.1 



ii\iirj«+eii:.ir>',i lua^ua 
JC10Rec'dPCT/PTO 2 5 JAN 2002 



FORM PTO -I390 
(REV. 12-2001) 



U.S. DEPARTMENT OF COMMERCE PATENT AND TRADEMARK OFFICE 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



ATTORNEY'S DOCKET NUMBER 

22221/1023 



U.S. APPLICATION NO. (If known, see 37 CFR 1.5 

10/048071 



INTERNATIONAL APPLICATION NO. 
PCT/US00/20666 



INTERNATIONAL FILING DATE 
28 July 2000 (28.07.00) 



PRIORITY DATE CLAIMED 
29 July 1999 (29.07.99) 



DNA1 



TEINS OF GRAM POSITIVE BACTERIA AND THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 



APPLICANT(S) FOR DO/EO/US O'DONNELL, Michael E.; BRUCK, Irina ; ZHANG, Dan and WHIPPLE, 
Richard 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 

1. [x] This is a FIRST submission of items concerning a filing under 35 U.S.C. 371 . 

2. □ This is a SECOND or SUBSEQUENT submission of items concerning a filing under 35 U.S.C. 371. 

3. [X] This is an express request to begin national examination procedures (35 U.S.C. 371(f)). The submission must include 

items (5), (6), (9) and (21) indicated below. 

4. [El The US has been elected by the expiration of 19 months from the priority date (Article 31). 

5. [X] A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. is attached hereto (required only if not communicated by the International Bureau). 

D- CU has been communicated by the International Bureau. 

c. [x] is not required, as the application was filed in the United States Receiving Office (RO/US). 

6. Q An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 

a. Q is attached hereto. 

b. Q has been previously submitted under 35 U.S.C. 154(d)(4). 

7. □ Amendments to the claims of the International Aplication under PCT Article 19 (35 U.S.C. 371(c)(3)) 

CH are attached hereto (required only if not communicated by the International Bureau), 
b- LZ] have been communicated by the International Bureau. 

c. O have not been made; however, the time limit for making such amendments has NOT expired. 

d. Q have not been made and will not be made. 

8. EH An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 

9. □ An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 

1 0. CH An English lanugage translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 20 below concern document(s) or information included: 

1. n An Information Disclosure Statement under 37 CFR 1 .97 and 1 .98. 

2. Q An assignment document for recording. A separate cover sheet in compliance with 37 CFR 3.28 and 3.31 is included. 
3. 1 I A FIRST preliminary amendment. 

A SECOND or SUBSEQUENT preliminary amendment. 
A substitute specification. 

A change of power of attorney and/or address letter. 

A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 
A second copy of the published international application under 35 U.S.C. 154(d)(4). 



4. Q 

5. D 

A second copy of the English language translation of the international application under 35 U.S.C. 154(d)(4). 
20. [x] Other items or information: 

UNSIGNED Combined Declaration and Power of Attorney 



page 1 of 2 



e t 



I 'Ml, 



U.S. APPLICATION NO. Of known, see 32*0^15).- 



b=a 't] international application no. 

Fdr/USOO/20666 



ATTORNEY'S DOCKET NUMBER 

22221/1023 



2 1 .[x] The following fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492 (a) (l)-(5)): 
Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1 .445(a)(2)) paid to USPTO 
and International Search Report not prepared by the EPO or JPO . 

International preliminary examination fee (37 CFR 1.482) not paid to 
USPTO but International Search Report prepared by the EPO or JPO 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

but international search fee (3 7 CFR 1 .445(a)(2)) paid to USPTO $740.00 



$1040.00 
. $890.00 



International preliminary examination fee (37 CFR 1.482) paid to USPTO 
but all claims did not satisfy provisions of PCT Article 33(l)-(4) 



International preliminary examination fee (37 CFR 1.482) paid to USPTO 
and all claims satisfied provisions of PCT Article 33(l)-(4) 

ENTER APPROPRIATE BASIC FEE AMOUNT 



$710.00 
$100.00 



CALCULATIONS PTO USE ONLY 



710.00 



Surcharge of $130.00 for furnishing the oath or declaration later than 20 |"x| 30 
months from the earliest claimed priority date (37 CFR 1 .492(e)). 



130.00 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total claims 



91 



-20 



71 



x $18.00 



1,278. 



00 



Independent claims 



•3 - 



x $84.00 



0.00 



MULTIPLE DEPENDENT CLAIM(S) (if applicable) 



+ $280.00 



0.00 



TOTAL OF ABOVE CALCULATIONS = 



2,118.00 



Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
are reduced by 1/2. 



1,059.00 



SUBTOTAL 



Processing fee of $130.00 for furnishing the English translation later than f~~]20 1 1 30 
months from the earliest claimed priority date (37 CFR 1 .492(f)). 



1,059.00 



0.00 



TOTAL NATIONAL FEE 



1,059.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
accompanied by an appropriate cover sheet (37 CFR 3.28, 3.3 1). $40.00 per property + 



0.00 



TOTAL FEES ENCLOSED 



1,059.00 



Amount to be 
refunded: 



charged: 



a. [X] A check in the amount of $ 1,059.00 

b. Please charge my Deposit Account No. 

A duplicate copy of this sheet is enclosed. 



. to cover the above fees is enclosed. 



in the amount of $ . 



to cover the above fees. 



c. [X] The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 14-1 138 a duplicate copy of this sheet is enclosed. 

d. Q Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1.137 (a) or (b)) must be filed and granted to restore the application to pending status. 



MillUS. * "v 



SEND ALL CORRESPONDENCE TO. 

Edwin V. Merkel 

Nixon Peabody LLP signature 
Clinton Square Edwin V. Merkel 



P.O. Box 31051 name 
Rochester, NY 14603-1051 

United States of America 40,087 



REGISTRATION NUMBER 



FORM PTO-1390 (REV 12-2001) page 2 of 2 



PATENT 

Docket No.: 22221/1023 (RU-429) 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Applicants : O'Donnell et al. 
Serial No. : 10/048,071 
Cnfrm. No. : 1435 
Filed : July 28, 2000 

For : DNA REPLICATION PROTEINS OF GRAM 

POSITIVE BACTERIA AND THEIR USE TO 
SCREEN FOR CHEMICAL INHIBITORS 



Examiner: 
To Be Assigned 

Art Unit: 
To Be Assigned 



STATEMENT IN ACCORDANCE WITH 37 C.F.R. § 1.821(f)-(g) 

U.S. Patent and Trademark Office 
P.O. Box 2327 
Arlington, VA 22202 
Box: PCT 

Sir: 

In accordance with 37 C.F.R. § 1.821(g), applicants hereby submit a Sequence 
Listing on a computer readable 3.5" Diskette. In accordance with 37 C.F.R. § 1.821(f), 
applicants confirm that the contents of the Sequence Listing in paper form (previously 
submitted) and in computer readable form (herewith) are the same. This submission contains 
no new matter. 

Respectfully submitted, 



Dated: Qc£(us & t 2# l>Z 




Edwin V. Merkel 
Registration No. 40,087 



NIXON PEABODY LLP 
Clinton Square, P.O. Box 31051 
Rochester, New York 14603-1051 
Telephone: (585)263-1128 
Facsimile: (585)263-1600 



Certificate of Mailing - 37 CFR 1 



I hereby certify that this correspondence b being 
deposited with the United States Postal Service as 
first class man in an envelope addressed to: 
U.S. Patent and TradenW* Offlpe RO. BQ)C2327 
Jon.VA 22202,> 




R624413.1 



WO 01/09164 



-1- 



PCT/US00/20666 



gNAREPLICATION PROTEINS OF GRAM POSITIVE B A CTFRT4ANTI 
THEIR USE TO SCREENTORCHEMICAL INHIBITORgT~~~^ 

The present application is a continuation-in-part of U.S. Patent 
Application Serial No. 09/235,245 filed January 22, 1999, which claims benefit of 
U.S. Provisional Patent Application Serial No. 60/093,727 filed July 22, 1998, and 
U.S. Provisional Patent Application Serial No. 60/074,522 filed January 22, 1998, all 
of which are hereby incorporated by reference. The present application also claims 
benefit of U.S. Provisional Patent Application Serial No. 60/146,178 filed July 29, 
1 999, which is hereby incorporated by reference. 

The present invention was made with funding from National Institutes 
of Health Grant No. GM38839. The United States Government may have certain 
rights in this invention. 

FIELD OF THE INVENTION 

This invention relates to genes and proteins that replicate the 
chromosome of Gram positive bacteria. These proteins can be used in sequencing, 
amplification of DNA, and in drug discovery to screen large libraries of chemicals for 
identification of compounds with antibiotic activity. 

BACKGROUND OF THE INVENTION 

All forms of life must duplicate the genetic material to propagate the 
species. The process by which the DNA in a chromosome is duplicated is called 
replication. The replication process is performed by numerous proteins that 
coordinate their actions to duplicate the DNA smoothly. The main protein actors are 
as follows (reviewed in Kornberg et al., DNA Replication . Second Edition, New 
York: W.H. Freeman and Company, pp. 165-194 (1992)). A helicase uses the energy 
of ATP hydrolysis to unwind the two DNA strands of the double helix. Two copies of 
the DNA polymerase use each "daughter" strand as a template to convert them into 
two new duplexes. The DNA polymerase acts by polymerizing the four monomer unit 
building blocks of DNA (the 4 dNTPs, or deoxynucleoside triphosphates are: dATP, 
dCTP, dGTP, dTTP). The polymerase rides along one strand of DNA using it as a 
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template that dictates the sequence in which the monomer blocks are to be 
polymerized. Sometimes the DNA polymerase makes a mistake and includes an 
incorrect nucleotide (e.g., A instead of G). A proofreading exonuclease examines the 
polymer as it is made and excises building blocks that have been improperly inserted 
in the polymer. 

Duplex DNA is composed of two strands that are oriented antiparallel 
to one another, one being oriented 3 '-5' and the other 5' to 3'. As the helicase 
unwinds the duplex, the DNA polymerase moves continuously forward with the 
helicase on one strand (called the leading strand). However, due to the fact that DNA 
polymerases can only extend the DNA forward from a 3' terminus, the polymerase on 
the other strand extends DNA in the opposite direction of DNA unwinding (called the 
lagging strand). This necessitates a discontinuous ratcheting motion on the lagging 
strand in which the DNA is made as a series of Okazaki fragments. DNA 
polymerases cannot initiate DNA synthesis de novo^but require a primed site (i.e., a 
short duplex region). This job is fulfilled by primase, a specialized RNA polymerase, 
that synthesizes short RNA primers on the lagging strand. The primed sites are 
extended by DNA polymerase. A single-stranded DNA binding protein ("SSB") is 
also needed; it operates on the lagging strand. The function of SSB is to coat single 
stranded DNA ("ssDNA"), thereby melting short hairpin duplexes that would 
otherwise impede DNA synthesis by DNA polymerase. 

The replication process is best understood for the Gram negative 
bacterium Escherichia coli and its bacteriophages T4 and T7 (reviewed in Kelman et 
ah, "DNA Polymerase III Holoenzyme: Structure and Function of Chromosomal 
Replicating Machine" Annu. Rev. Biochem. . 64:171-200 (1995); Marians, K.J., 
"Prokaryotic DNA Replication," Annu. Rev. Biochem. . 61 :673-719 (1992); McHenry, 
C.S., "DNA Polymerase m Holoenzyme: Components, Structure, and Mechanism of 
a True Replicative Complex," J. Bio. Chem. . 266:19127-19130 (1991); Young et al., 
"Structure and Function of the Bacteriophage T4 DNA Polymerase Holoenzyme," 
Am. Chem. Soc, 31 :8675-8690 (1992)). The eukaryotic systems of yeast 
{Saccharomyces cerevisae) (Morrison et al., "A Third Essential DNA Polymerase in 
S. cerevisiaer Cell, 62:1 143-51 (1990) and humans (Bambara et al., "Reconstitution 
of Mammalian DNA Replication," Prog. Nuc. Acid Res. ," 51 :93-123 (1995)) have 
also been characterized in some detail as has herpes virus (Boehmer et al., "Herpes 
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Simplex Virus DNA Replication," Annu. Rev. Biochem. . 66:347-384 (1997)) and 
vaccinia virus (McDonald et al., "Characterization of a Processive Form of the 
Vaccinia Virus DNA Polymerase/' Virology . 234:168-175 (1997)). The helicase of E. 
coli is encoded by the dnaB gene and is called the DnaB-helicase. In phage T4, the 
helicase is the product of the gene 41 , and, in T7, it is the product of gene 4. 
Generally, the helicase contacts the DNA polymerase in E. coli. This contact is 
necessary for the helicase to achieve the catalytic efficiency needed to replicate a 
chromosome (Kim et al., "Coupling of a Replicative Polymerase and Helicase: A tau- 
DnaB Interaction Mediates Rapid Replication Fork Movement," Cell, 84:643-650 
(1996)). The identity of the helicase that acts at the replication fork in a eukaryotic - 
cellular system is still not firm. 

The primase of E. coli (product of the dnaG gene), phage T4 (product 
of gene 61), and T7 (gene 4) require the presence of their cognate helicase for activity. 
The primase of eukaryotes, called DNA polymerase alpha, looks and behaves 
differently. DNA polymerase alpha is composed of 4 subunits. The primase activity 
is associated with the two smaller subunits, and the largest subunit is the DNA 
polymerase which extends the product of the priming subunits. DNA polymerase 
alpha does not need a helicase for priming activity on single strand DNA that is not 
coated with binding protein. 

The chromosomal replicating DNA polymerase of all these systems, 
prokaryotic and eukaryotic, share the feature that they are processive, meaning they 
remain continuously associated with the DNA template as they link monomer units 
(dNTPs) together. This catalytic efficiency can be manifest in vitro by their ability to 
extend a single primer around a circular ssDNA of over 5,000 nucleotide units in 
length. Chromosomal DNA polymerases will be referred to here as replicases to 
distinguish them from DNA polymerases that function in other DNA metabolic 
processes and are far less processive. 

There are three types of replicases known thus far that differ in how 
they achieve processivity and how their subunits are organized. These will be referred 
to here as Types I-DI. The Type I is exemplified by the phage T5 replicase, which is 
composed of only one subunit yet is highly processive (Das et al., "Mechanism of 
Primer-template Dependent Conversion of dNTP-dNMP by T7 DNA Polymerase," L 
Biol. Chem., 255:7149-7154 (1980)). It is possible that the T5 enzyme achieves 
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processivity by having a cavity within it for binding DNA, with a domain of the 
protein acting as a lid that opens to accept the DNA and closes to trap the DNA inside, 
thereby keeping the polymerase on DNA during polymerization of dNTPs. Type II is 
exemplified by the replicases of phage T7, herpes simplex virus, and vaccinia virus. 
5 In these systems, the replicase is composed of two subunits, the DNA polymerase and 

an "accessory protein" which is needed for the polymerase to become highly efficient. 
It is presumed that the DNA polymerase binds the DNA in a groove and that the 
accessory protein forms a cap over the groove, trapping the DNA inside for processive 
action. Type HI is exemplified by the replicases of E. coli, phage T4, yeast, and 

10 humans in which there are three separate components, a sliding clamp protein, a 

clamp loader protein complex, and the DNA polymerase. In these systems, the sliding 
clamp protein is an oligomer in the shape of a ring. The clamp loader is a 
multiprotein complex which uses ATP to assemble the clamp around DNA. The 
DNA polymerase then binds the clamp which tethers the polymerase to DNA for high 

15 processivity. The replicase of the E. coli system contains a fourth component called 

tau that acts as a glue to hold two polymerases and one clamp loader together into one 
structure called Pol m*. In this application, any replicase that uses a minimum of 
three components (i.e., clamp, clamp loader, and DNA polymerase) will be referred to 
as either a three component polymerase, a type HI enzyme, or a DNA polymerase III- 

20 type replicase. 

The E. coli replicase is also called DNA polymerase HI holoenzyme. 
The holoenzyme is a single multiprotein particle that contains all the components; it is 
comprised often different proteins. This holoenzyme is suborganized into four 
functional components called: 1) Pol m core (DNA polymerase); 2) gamma complex 

25 or tau/gamma complex (clamp loader); 3) beta subunit (sliding clamp); and 4) tau 

(glue protein). The DNA polymerase m "core" is a tightly associated complex 
containing one each of the following three subunits: 1) the alpha subunit is the actual 
DNA polymerase (129 kDa); 2) the epsilon subunit (28 kDa) contains the 
proofreading 3-5' exonuclease activity; and 3) the theta subunit has an unknown 

30 function. The gamma complex is the clamp loader and contains the following 

subunits: gamma, delta, delta prime, chi and psi (U.S. Patent No. 5,583,026 to 
O'Donnell). Tau can substitute for gamma, as can a tau/gamma heterooligomer. The 
beta subunit is a homodimer and forms the ring shaped sliding clamp. These 
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components associate to form the holoenzyme and the entire holoenzyme can be 
assembled in vitro from 10 isolated pure subunits (U.S. Patent No. 5,583,026 to 
O'Donnell; U.S. Patent No. 5,668,004 to O'Donnell). The £. coli dnaX gene encodes 
both tau and gamma. Tau is the product of the full gene. Gamma is the product of the 
first 2/3 of the gene; it is truncated by an efficient translational frameshift that results 
in incorporation of one unique residue followed by a stop codon. 

The tau subunit, encoded by the same gene that encodes gamma 
(dnaX) y also acts as a glue to hold two cores together with one gamma complex. This 
subassembly is called DNA polymerase m star (Pol HI*). One beta ring interacts 
with each core in Pol HI* to form DNA polymerase HI holoenzyme. 

During replication, the two cores in the holoenzyme act coordinately to 
synthesize both strands of DNA in a duplex chromosome. At the replication fork, 
DNA polymerase DI holoenzyme physically interacts with the DnaB helicase through 
the tau subunit to form a yet larger protein complex termed the "replisome" (Kim et 
al., "Coupling of a Replicative Polymerase and Helicase: A tau-DnaB Interaction 
Mediates Rapid Replication Fork Movement," Cell 84:643-650 (1996); Yuzhakov et 
al., "Replisome Assembly Reveals the Basis for Asymmetric Function in Leading and 
Lagging Strand Replication," Cell 86:877-886 (1996)). The primase repeatedly 
contacts the helicase during replication fork movement to synthesize RNA primers on 
the lagging strand (Marians, K.J., "Prokaryotic DNA Replication," Annu. Rev. 
Biochem. , 61:673-719 (1992)). 

Intensive subtyping of prokaryotic cells has now lead to a taxonomic 
classification of prokaryotic cells as eubacteria (true bacteria) to distinguish them 
from archaebacteria. Within eubacteria are many different subcategories of cells, 
although they can broadly be subdivided into Gram positive - and Gram negative-like 
cells. Numerous complete and partial genome sequences of prokaryotes have 
appeared in the public databases. 

In the present invention, new genes from the Gram positive bacteria, 
Streptococcus pyogenes (e.g., S. pyogenes) and Staphylococcus aureus (e.g., S. 
aureus) are identified. They are assigned names based on their nearest homology to 
subunits in the E. coli system. The genes encoding E. coli replication proteins are as 
follows: alpha (dnaE); epsilon (dnaQ); theta (holE); tau (full length dnaX); gamma 
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(frameshift product of rfnoA); delta (Ao/4); delta prime (holB); chi (holQ; psi (holD); 
beta (dnaN); DnaB helicase (dnaB); and primase (dnaG). 

Study of the organisms for which a complete genome sequence is 
available reveals that no organism has identifiable homologies to all the subunits of 
5 the E. coli three component polymerase, Pol m holoenzyme (see Table 1 below). All 

other organisms lack the 6 subunit (holE), and all except one lack genes encoding 
the x and y subunits (holC and holD, respectively) as judged by sequence comparison 
searches. Further, the a and e subunits are fused into one large a subunit in some 
organisms (e.g., Gram positive cells) as detailed in (Sanjanwala et aL, "DNA 

10 Polymerase m Gene of Bacillus subtilis" Proc. Natl. Acad. Sci., USA . 86:4421-4424 

(1989)). Although all organisms have homologues to t, p, 5' and SSB, the 5 subunit has 
diverged significantly (either not recognized or nearly not recognized by gene 
searching programs), perhaps even to the point where it is no longer involved in DNA 
replication. The DnaX product also would appear to lack frameshift signals in most 

15 organisms. This predicts only one protein (tau) will be produced from this gene, 

instead of two as in E. coli. Indeed, this has been shown to be true for the 
Staphylococcus aureus DnaX (U.S. Patent Application Serial No. 09/235,245, which is 
hereby incorporated by reference). Finally, genetic study of Bacillus subtilis identified 
two genes that do not have counterparts in E. coli {dnaB, not the helicase, and dnaH) as 

20 well as one other gene, dual, that is only very distantly related to E. coli dnaC 

(Karamata et aL, "Isolation and Genetic Analysis of Temperature-Sensitive Mutants of 
B. subtilis Defense in DNA Synthesis," Molec. Gen. Genet. . 108:277-287 (1970); 
Braund et aL, "Nucleotide Sequence of the Bacillus subtilis dnaD Gene," Microb., 
141:321-322 (1995); Hoshino et aL, "Nucleotide Sequence of Bacillus subtilis dnaB: A 

25 Gene Essential for DNA Replication Initiation and Membrance Attachment," Proc. 

Natl. Acad. Sci. USA ." 84:653-657 (1987)). Keeping in mind the apparently random, 
or at least unpredictable process of evolution, it is possible that these apparently new 
genes perform novel functions that may result in a new type of polymerase for 
chromosomal replication. Thus, it seems possible that new proteins may have evolved 

30 to take the place of x, y, 0, the frameshift product of DnaX, and possibly 6 in other 

eubacteria. These considerations indicate that the three component polymerase of 
different eubacteria may have different structures. That this may be so would not be 
surprising as different bacteria are often less related evolutionary than plants are to 



WO 01/09164 



-7- 



PCT/US00/20666 



humans. For example, the split between Gram positive and Gram negative bacteria 
occurred about 1 .2 billion years ago. This distant split makes Gram positive cells an 
attractive source to examine how different other eubacterial three component 
polymerases are from the E. coli Pol EI holoenzyme. 

Table 1 



Organism (Order) 


X 


(g e 




a 




dnaX 




5 




Escherichia coli 
Proteobacteria 




+ + 


+ 


+ 


+ 




+ 






Haemophilus influenzae 
Proteobacteria 


+ 


+ 




+ 


+ 


+ 








Mycoplasma genitalium 
Firmicutes 


- 


- 


- 


+ 


4- 






+ 


(weak) 


Synichisystis sp. 
Cyanobacteria 


- 


- 


- 


+ 


+ 




+ 




(weak) 


Bacillus subtilis 
Firmicutes 








+ 


+ 


+ 


+ 


+ 


(weak) 


Borrelia burgdorferi 
Spirochaetales 








+ 




+ 


+ 


+ 


(weak) 


Aquifex aeolicus 
Aquificales 








+ 


+ 




+ 




(weak) 


Mycobacterium tuberculosis 
Firmicutes & Actinobacteria 






+ 


+ 


+ 




+ 


+ 


(weak) 


Treponema pallidum 
Spirochaetales 






+ 


+ 


+ 




+ 


+ 


(weak) 


Chlamydia trachomatis 
Chlamydial es 








+ 




+ 


+ 




(weak) 


Rickettsia prowazekii 
Proteobacteria 






+ 


+ 


+ 








(weak) 


Helicobacter pylori 
Proteobacteria 






+ 




+ 


+ 


4- 


+ 


(weak) 


Thermatoga maritima 
Thermotogales 








+ 




+ 




+ 


(weak) 



The goal of this invention is to learn how to form a functional three 
component polymerase from an organism that is highly divergent from E. coli and 
whether it is as rapid and processive as the E. coli Pol HI holoenzyme. Namely, from 
bacteria lacking x, or e, or having a widely divergent 5 subunit, or having only one 
DnaX product, or an a subunit that encompasses both a and e activities. All 
eubacteria for which the entire genome has been sequenced have at least one of these 
differences from E. coli. Many Gram negative bacteria have one or more of these 
differences (e.g., Haemophilus influenzae and Aquifex aeolicus ). Bacteria of the 
Gram positive class have all of these different features. Because of the distant 
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evolutionary split between Gram positive and Gram negative bacteria, their 
mechanisms of replication may have diverged significantly as well. Indeed, 
purification of the replication polymerase from B. subtilis, a Gram positive cell, gives 
only a single subunit polymerase (Barnes et al., "Purification of DNA Polymerase m 
of Gram-Positive Bacteria," Methods Enzv. 262:35-42 (1995); Barnes et al., 
"Antibody to B. subtilis DNA Polymerase HI: Use in Enzyme Purification and 
Examination of Homology Among Replication-specific DNA Polymerases," NucL 
Acids Res., 6:1203-209 (1979); Barnes et al., "DNA Polymerase ffl of Mycoplasma 
pulmonis: Isolation and Characterization of the Enzyme and its Structural Gene, 
polC," Mol. Microb. . 13:843-854, (1994); Low et al., "Purification and 
Characterization of DNA Polymerase III from Bacillus subtilis" J. Biol. Chem.. 
251:131 1-1325 (1976)) instead of a 10 subunit assembly containing the three 
components of a rapidly processive machine as discussed above for Pol III 
holoenzyme from E. coli. This finding suggests a different structural organization of 
the replicase and possibly different functional characteristics as well. 

Although there are many studies of replication mechanisms in 
eukaryotes and, specifically, the Gram negative bacterium E. coli and its 
bacteriophages, there is very little information about how Gram positive organisms 
replicate. The Gram positive class of bacteria includes some of the worst human 
pathogens such as Staphylococcus aureus, Streptococcus pneumoniae, Streptococcus 
pyogenes, Enterococcus faecalis, and Mycobacterium tuberculosis (Youmans et al., 
The Biological and Clinical Basis of Infectious Disease (1985)). Until this invention, 
the best characterized Gram positive organism for chromosomal DNA synthesis was 
Bacillus subtilis. Fractionation of B. subtitis has identified three DNA polymerases. 
(Gass et al., "Further Genetic and Enzymological Characterization of the Three 
Bacillus subtilis Deoxyribonucleic Acid Polymerases," J. Biol. Chem.. 248:7688-7700 
(1973); Ganesan et al., "DNA Replication in a Polymerase I Deficient Mutant and the 
Identification of DNA Polymerases II and m in Bacillus subtilis" Biochem. Biophvs. 
Res. Commun., 50:155-163 (1973)). These polymerases are thought to be analogous 
to the three DNA polymerases of E. coli (DNA polymerases I, n, and EI). Studies in 
B. subtilis have identified a polymerase that appears to be involved in chromosome 
replication and is termed Pol m (Ott et al., "Cloning and Characterization of the polC 
Region of Bacillus subtilis," J. Bacteriol. . 165:951-957 (1986); Barnes et al., 



WO 01/09164 



-9- 



PCT/US00/20666 



"Localization of the Exonuclease and Polymerase Domains of Bacillus subtilis DNA 
Polymerase III," Gene , 1 1 1 :43-49 (1992); Barnes et al., 'The 3*-5' Exonuclease Site 
of DNA Polymerase HI From Gram-positive Bacteria: Definition of a Novel Motif 
Structure/' Gene " 165:45-50 (1995) or Barnes et al., "Purification of DNA 
Polymerase m of Gram-positive Bacteria," Methods in Enzv. » 262:35-42 (1995)). The 
B. subtilis Pol m (encoded by polC) is larger (about 165 kDa) than the E. coli alpha 
subunit (about 129 kDa) and exhibits 3'-5* exonuclease activity. The polC gene 
encoding this Pol LQ shows weak homology to the genes encoding E. coli alpha and 
the E. coli epsilon subunit. Hence, this long form of the B. subtilis Pol m (herein 
referred to as a -large or Pol KI-L) essentially comprises both the alpha and epsilon 
subunits of the E. coli core polymerase. The S. aureus a -large has also been 
sequenced, expressed in E. coli f and purified; it contains DNA polymerase and 3*-5' 
exonuclease activity (Pacitti et al., "Characterization and Overexpression of the Gene 
Encoding Staphylococcus aureus DNA Polymerase m," Gene , 165:51-56 (1995)). 
Although a -large is essential to cell growth (Clements et al., "Inhibition of Bacillus 
subtilis Deoxyribonucleic Acid Polymerase HI by Phenylhydrazinopyrimidines: 
Demonstration of a Drug-induced Deoxyribonucleic Acid-Enzyme Complex," J. Biol. 
Chem 250:522-526 (1975); Cozzarelli et al., "Mutational Alteration of Bacillus 
subtilis DNA Polymerase HI to Hydroxyphenylazopyrimidine Resistance: Polymerase 
m is Necessary for DNA Replication," Biochem. And Biophy. Res. Commun. . 
51:151-157 (1973); Low et al, "Mechanism of Inhibition of Bacillus subtilis DNA 
Polymerase m by the Arylhydrazinopyrimidine Antimicrobial Agents," Proc. Natl. 
Acad. Sci. USA, 71 :2973-2977 (1974)), there could still be another DNA 
polymerase(s) that is essential to the cell, such as occurs in yeast (Morrison et al., "A 
Third Essential DNA Polymerase in S. cerevisiae" Cell 62:1 143-1 151 (1990)). 

Purification of a -large from B. subtilis results in only this single 
protein without associated proteins (Barnes et al., "Localization of the Exonuclease 
and Polymerase Domains of Bacillus subtilis DNA Polymerase HI," Gene , 1 1 1 :43-49 
(1992); Barnes et al., "The 3 '-5' Exonuclease Site of DNA Polymerase m From 
Gram-positive Bacteria: Definition of a Novel Motif Structure," Gene " 165:45-50 
(1995) or Barnes et aL, "Purification of DNA Polymerase m of Gram-positive 
Bacteria," Methods in Enzvmol.. 262:35-42 (1995)). Hence, it is possible that a -large 
is a member of the Type I replicase (like T5) in which it is processive on its own 
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without accessory proteins. B. subtilis and S. aureus also have a gene encoding a 
protein that has approximately 30% homology to the beta subunit of E. coli; however, 
the protein product has not been purified or characterized (Alonso et al., "Nucleotide 
Sequence of the recF Gene Cluster From Staphylococcus aureus and 
Complementation Analysis in Bacillus subtilis recF Mutants/' Mol. Gen. Genet. . 
246:680-686 (1995); Alonso et al., "Nucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants/' Mol. Gen. Genet.. 248:635-636 (1995)). Whether this beta subunit has a 
function in replication, a ring shape, or functions as a sliding clamp was not known 
until recently. It was also not known whether it is functional with a -large. Recently, 
it was shown that £ aureus p is functional as a ring, and that it also functions with a - 
large (U.S. Patent Application Serial No. 09/235,245, which is hereby incorporated by 
reference). Further, a fourth DNA polymerase was identified with greater homology 
to E. coli a than a -large. This polymerase, called herein a -small, is shorter than a - 
large and lacks the domain homologous to epsilon. This polymerase also functions 
with the p ring, indicating that it may participate in chromosome replication. Indeed, a 
recent report indicates that a -small is essential for replication in Streptomyces 
coelicolor A3(2) (Flett et al., "A Gram-negative type ? DNA Polymerase m is Essential 
for Replication of the Linear Chromosome of Streptomyces Coelicolor A3(2)," Mol. 
Micro., 31 :949-958, (1999)). 

As described earlier, purification of the replicase from the Gram 
positive B. subtilis gives only a single subunit Pol m, instead of a multicomponent 
complex. Also, S. aureus dnaX has been shown to encode only one subunit (U.S. 
Patent Application Serial No. 09/235,245, which is hereby incorporated by reference). 
Moreover, S. aureus and B. subtilis lack homologues to x, y, G, and the 5 subunit is 
only weakly homologous to 5 of E, coli (only 28%). Further, they lack a homologue to 
dnaQ encoding e. Instead, they contain this activity (3 '-5' exonuclease) in the polC 
gene product which provides the a -large form of a. The e subunit is needed for high 
speed and processivity of the E. coli Pol m holoenzyme; the a subunit alone is much 
less rapid and processive with the p ring compared to the presence of both 
a and e (Studwell et al., "Processive Replication is Contingent on the Exonuclease 
Subunit of DNA Polymerase m Holoenzyme/' J. Biol Chem . 265: 1 171-1 178 (1990)). 
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Studies using the E. coli p ring (and y complex) show they confer onto 
S. aureus a quite efficient synthesis (U.S. Patent Application Serial No. 09/235,245, 
which is hereby incorporated by reference), but the efficiency is not equal to that of E. 
coli ae with (3 (and y complex). This may be due to use of the heterologous 
combination of an a subunit from one organism (S. aureus) with the p clamp from 
another (E. coli.). However, it is also possible that S. aureus a simply does not 
function with a p clamp to produce speed and processivity comparable to the E. coli 
polymerase. Also, as described earlier, the a -large subunit of B. subtilis purifies as a 
single subunit, rather than associated with accessory subunits assembled into the three 
components of a rapid, processive machine (i.e., like E. coli Pol m holoenzyme). The 
lack of two DnaX products, lack of a multicomponent structure, and lack of gene 
homologues encoding several subunits of the three component, Pol HI, of E. coli brings 
into question whether other types of bacteria, such as Gram positive cells, even have an 
enzyme with similar structure or comparable speed and processivity to that found in 
the Gram negative E. coli. 

The lack of gene homologues encoding several subunits of the E. coli 
three component polymerase creates uncertainties with respect to reconstructing a 
rapid and processive polymerase from a Gram positive cell that has characteristics like 
the Pol m system of E. coli. 

The y and S' proteins are homologous to one another, encoding C-shape 
proteins (Dong et al., "DNA Polymerase IE Accessory Proteins," J. Biol. Chem . 
268:1 1758-1 1765, (1993); Guenther et al., "Crystal Structure of the 5' Subunit of the 
Clamp-loader Complex of E. coli DNA Polymerase m," Cell , 91:335-345 (1997)). 
The clamp loaders of yeast and humans are composed of five proteins, all of which are 
homologous to one another and to y and 5' (Cullman et al., "Characterization of the 
Five Replication Factor C Genes of Saccharomyces Cerevisiae," Mol. Cell. Biol. , 
15:4661-4671 (1995)). This provides evidence that a clamp loader can be composed 
entirely of C-shape proteins. Perhaps the Gram positive DnaX-protein (hereafter 
referred to as t) and 5* are sufficient to provide function as a clamp loader. Indeed, the 
clamp loader of T4 phage is composed of only two different proteins, gp44/62 
complex (Young et al., "Structure and Function of the Bacteriophage T4 DNA 
Polymerase Holoenzyme," Biochem.. 31 : 8675-8690 (1992)). This idea is also 
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supported by the presence of only two RFC genes in archaebacteria, suggesting that 
they may utilize two C-shaped proteins for clamp loading, in contrast to yeast and 
humans that use five. With this consideration in mind, genes were identified and 
isolated and the t protein (encoded by dnaX) and 6' (encoded by holE) of another 
5 Gram positive organism, Streptococcus pyogenes, were expressed and purified. As 

was observed in S. aureus, S. pyogenes dnaX produces only a single polypeptide. The 
p, encoded by dnaN of A pyogenes, was also identified, expressed, and purified, as 
were the a -large subunit encoded by polC and the SSB encoded by the ssb gene. 
These proteins were studied for interactions and characterized for their effect on a- 

10 large. However, the hypothesis was incorrect as x and 6' did not form a x6' complex, 

nor did they assemble p onto DNA or provide stimulation of a when using p on 
primed and SSB coated M13mpl8 ssDNA. 

In light of the inability of S. pyogenes x protein and 5* to function as a 
clamp loader, it seemed reasonable to expect that one or more other proteins are 

15 needed. The fact that E. coli has some replicase subunits that other bacteria do not, 

suggests that other bacteria may have some replicase subunits that E. coli does not. 
Indeed, genetic studies of Bacillus subtilis demonstrates that it has three genes needed 
for replication that E. coli does not have. Two of these novel genes, called dnaB (not 
the same as E. coli dnaB encoding the helicase) and dnaH, have no significant 

20 homology to genes in the E. coli genome database (Bruand et al., "Nucleotide 

Sequence of the Bacillus subtilis dnaD gene," Microbiol .. 141 :321-322 (1995); 
Hoshino et al., "Nucleotide Sequence of Bacillus subtilis dnaB: A gene Essential for 
DNA replication Initiation and Membrane Attachment," Proc. Natl. Acad. Sci. USA. 
84:653-657 (1987)). Further, dual of B. subtilis is important for replication and has, 

25 at best, a very limited homology to E. coli dnaC (Karamata et al., "Isolation and 

Genetic Analysis of Temperature-Sensitive Mutants ofB. subtilis Defective in DNA 
synthesis," Molec. Gen. Genetics . 108:277-287 (1970)). Perhaps one or more of these 
genes encode the proteins(s) necessary to provide clamp loading activity when 
combined with x and 5\ or to couple with a to provide it with speed and/or 

30 processivity as the E. coli epsilon does. The S. pyogenes homologues of B. subtilis 

dnal, dnaH f and dnaB were identified, cloned, and the encoded proteins were 
expressed and purified. However, these proteins failed to provide activity alone or in 
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combinations with S. pyogenes x and 5* in loading S. pyogenes p onto DNA, or in 
stimulating S. pyogenes a -large in combination with p, x, and S* on SSB coated 
primed M13mpl8 ssDNA. 

Weak homology exists for the holA gene among prokaryotes. This 
weak homologue of holA was identified in S. pyogenes and, then, it was cloned, 
expressed, and the putative 5 was purified. The putative 6 formed an isolatable 
complex with x and 5'. In fact, the x56 f complex loaded S, pyogenes p onto DNA, and 
it stimulated S. pyogenes a -large in a p dependent reaction on primed SSB coated 
M13mpl8 ssDNA. Hence, this protein was the only missing component necessary to 
provide clamp loading activity. Further, a mixture of a with x55', followed by ion 
exchange chromatography on MonoQ, indicated formation of an ccx55' complex. 
Consistent with this, x appeared to bind a in gel filtration analysis. 

Whether the S. pyogenes three component polymerase can synthesize 
DNA in as rapid and processive of a fashion as the E. coli Pol m holoenzyme three 
component polymerase is very difficult to predict, because no other DNA polymerase 
known to date catalyzes synthesis at the rate or processivity of the E. coli three 
component polymerase. For example, the three component T4 phage polymerase 
travels about 400 nucleotides/s, the yeast DNA polymerase delta three component 
polymerase travels about 120 nucleotides/s, and the human DNA polymerase delta 
three component enzyme appears slower and less processive than the yeast enzyme. 

The standard test for these speed and processivity characteristics is 
examination of a time course in extension of a primer on a very long template, such as 
around the 7.2 kb M13mpl8 ssDNA genome coated with SSB and primed with a 
synthetic DNA oligonucleotide. The results of experiments of this type demonstrate 
that the three component £ pyogenes polymerase is indeed extremely rapid in 
synthesis. Surprisingly, it is just as fast as the E. coli enzyme. Extension proceeds at 
about 700-800 nucleotides per second, completing the entire template in about 9 
seconds. The enzyme was fully processive throughout replication of the Ml 3mpl 8 
genome, as could be determined from the fact that some templates were not extended 
at all, while others were extended to completion. If the enzyme had not been 
processive during the entire replication reaction, then when it comes off one partially 
extended DNA genome it would have reassociated with the unextended DNA that 
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remained and partially replicated it as well (and so on until the entire population of 
DNA became fully replicated). This did not happen. Instead, the reaction showed a 
mixture of completely replicated templates and templates that were still untouched 
starting material. This indicates that the enzyme stays with a template until it 
5 completes it before it cycles over to replicate another one (i.e., it is highly 

processive). Each of the five proteins, a, x, 6, 5 f and p, are needed to obtain this rapid 
and processive DNA synthesis. 

This invention has provided an intellectual template by which the 
clamp loader component of these three component polymerases can be obtained from 

10 any eubacterial prokaryotic cell and how to use it with the other components to 

produce a rapid and processive polymerase. All prokaryotes in the eubacterial 
kingdom that have been sequenced to date contain homologues of these genes. As the 
process of lateral gene transfer appears to be a major force in evolution, it would 
appear that relatedness of enzymes and enzyme machines is best judged by 

15 comparisons of their genes and proteins rather than by phylogeny of which bacteria 

they are in (Doolittle et al., "Archaeal Genomics: Do Archaea have a Mixed 
Heritage?/' Curr. Biol. , 8:R209-R21 1 (1998)). As pointed out earlier in this 
application, most bacteria have genetic characteristics of replication genes/proteins of 
S. pyogenes rather than that of E, coli (i.e., no genes encoding x,vj>, or 9, only a weak 

20 homolog to 5, or a dnaX gene encoding only a single protein). 

The dnaX gene encoding x and y in E. coli encodes only one protein in 
some organisms, but, as this application shows, it is still functional in forming a 
protein complex capable of rapid and processive DNA synthesis. In addition, this 
application shows that the delta subunit, which is only weakly homologous among 

25 different prokaryotic organisms, is an essential functional subunit of the three 

component polymerase (instead of having diverged so as to fulfill an entirely different 
function in some other intracellular process). As mentioned earlier, several genes 
encoding subunits of the E. coli clamp loader (y complex; y,8fi\x>v) are not obviously 
present in other prokaryotes (JtolC and holD encoding % and U>)- Hence, one may 

30 anticipate that other genes may have evolved to encode new subunits that replace 

these, and that these new subunits may have been essential to the activity of the clamp 
loader. For example, they may have either taken over some of the functionality of 
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another subunit, or structurally (e.g., the physical presence of a subunit could be 
needed for one subunit to assume its proper and active conformation, or for one or 
more of the subunits to form a complex together to yield the multisubunit clamp 
loader assembly). In addition, this application shows that the a subunit (polC gene 
5 product) is sufficient for rapid and processive synthesis with the other two 

components (i.e., E. coli requires e submit to bind to a for rapid and processive 
synthesis of a with the p clamp). Finally, this application shows that the S. pyogenes 
three component polymerase synthesizes DNA as fast as the E. coli Pol in three 
component polymerase. Up to this point, the E. coli Pol JR three component 

10 polymerase was over twice the speed of the T4 enzyme and over 5 times the speed of 

others. Hence, it was possible that E. coli may have been unique among prokaryotes 
in having a polymerase that achieves such speed. This invention shows that this is not 
the case. Instead, this speed in polymerization generalizes to the Gram positive 
prokaryotic three component DNA polymerases. It may be presumed, now that two 

15 examples of three component polymerases in widely divergent bacteria share the 

charactistics of rapid, processive synthesis, that the three component polymerase of 
other eubacteria will also be rapid and processive. 

These rapid and processive three component DNA polymerases can be 
applied to several important uses. DNA polymerases currently in use for DNA 

20 sequencing and DNA amplification use enzymes that are much slower and thus could 

be improved upon. This is especially true of amplification as the three component 
polymerase is capable of speed and high processivity making possible amplification of 
very long (tens of Kb to Mb) lengths of DNA in a time efficient manner. These three 
component polymerases also function in conjunction with a replicative helicase 

25 (DnaB) and, thus, are capable of amplification at ambient temperature using the 

helicase to melt the DNA duplex. This property could be useful in amplification 
reaction procedures such as in polymerase chain reaction (PCR) methodology. 
Finally, these three component polymerases and their associated helicase (DnaB) and 
primase (DnaG) are attractive targets for antibiotics due to their essential and central 

30 role in cell viability. 

This application provides a three component polymerase from two 
human pathogens in the Gram positive class. It makes possible the production of this 
three component polymerase from other bacteria of the Gram positive type (e.g., 
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Streptococcic Staphylococci , Mycoplasma) and other types of bacteria 
lacking x>V> or 9, those having only one protein produced by their dnaX gene such as 
obligate intracellular parasites, Mycoplasmas (possibly evolved from Gram positives), 
Cyanobacteria (Synechocystis), Spirochaetes such as Borrelia and Treponemia and 
5 Chlamydia^ and distant relatives of E. coli in the Gram negative class (e.g., Rickettsia 

and Helicobacter). These three component polymerases are useful in manipulation of 
nucleic acids for research and diagnostic purposes (e.g., sequencing and amplification 
methods) and for screening chemicals for antibiotic activity (useful in human or 
animal therapy and agriculture such as animal feed supplements). There are several 

10 assays described previously in U.S. Patent Application Serial No. 09/235,245 to 

O'Donnell et aL, which is hereby incorporated by reference, that use these three 
component polymerases (or subassemblies), as well as the DnaB and DnaG 
homologues, either alone or in various combinations, for the purpose of screening 
chemicals, such as chemical libraries, for inhibitor activity. Such inhibitors can be 

15 developed further (usually by chemical manipulation and alteration) into lead 

compounds and then into full fledged pharmaceuticals. 

There remains a need to understand the molecular details of the process 
of DNA replication in other cells that are quite different from E. coli 7 such as in Gram 
positive cells. It is possible that a more detailed understanding of replication proteins 

20 will lead to discovery of new antibiotics. Therefore, a deeper understanding of 

replication proteins of Gram positive bacteria is especially important given the 
emergence of drug resistant strains of these organisms. For example, Staphylococcus 
aureus has successfully mutated to become resistant to all common antibiotics. 

The "target" protein(s) of an antibiotic drug is generally involved in a 

25 critical cell function, such that blocking its action with a drug causes the pathogenic 

cell to die or no longer proliferate. Current antibiotics are directed to very few targets. 
These include membrane synthesis proteins (e.g., vancomycin, penicillin, and its 
derivatives such as ampicillin, amoxicillin, and cephalosporin), the ribosome 
machinery (e.g., tetracycline, chloramphenicol, azithromycin, and the aminoglycosides 

30 such as kanamycin, neomycin, gentamicin, streptomycin), RNA polymerase (e.g., 

rifampimycin), and DNA topoisomerases (e.g., novobiocin, quinolones, and 
fluoroquinolones). The DNA replication apparatus is a crucial life process and, thus, 
the proteins involved in this process are good targets for antibiotics. 
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A powerful approach to discovery of a new drug is to obtain a target 
protein, characterize it, and develop in vitro assays of its cellular function. Large 
chemical libraries can then be screened in the functional assays to identify compounds 
that inhibit the target protein. These candidate pharmaceuticals can then be 
chemically modified to optimize their potency, breadth of antibiotic spectrum, non- 
toxicity, performance in animal models and, finally, clinical trials. The screening of 
large chemical libraries requires a plentiful source of the target protein. An abundant 
supply of protein generally requires overproduction techniques using the gene 
encoding the protein. This is especially true for replication proteins as they are 
present in low abundance in the cell. 

Selective and robust assays are needed to screen reliably a large 
chemical library. The assay should be insensitive to most chemicals in the 
concentration range normally used in the drug discovery process. These assays should 
also be selective and not show inhibition by antibiotics known to target proteins in 
processes outside of replication. 

The present invention is directed to overcoming these deficiencies in 

the art. 

SUMMARY OF THE INVENTION 

The present invention relates to various isolated DNA molecules from 
Staphylococcus aureus and Streptococcus pyogenes, both of which are Gram positive 
bacteria. These include DNA molecules which include a coding region from the dnaE 
gene (encoding a- small), dnaX gene (encoding tau), polC gene (encoding Pol EI — L 
or a- large), dnaN gene (encoding beta), holA gene (encoding delta), holB gene 
(encoding delta prime), ssb gene (encoding SSB), dnaB gene (encoding DnaB), and 
dnaG gene (encoding DnaG) of 5. aureus and/or S. pyogenes. These DNA molecules 
can be inserted into an expression system and used to transform host cells. The 
isolated proteins or polypeptides encoded by these DNA molecules, and their ability to 
function when used in combination is also disclosed. The resulting actions provide 
assembling a ring onto DNA via a clamp loader, and polymerase activity dependent 
on this ring that is rapid and processive. 
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A further aspect of the present invention relates to a method of 
identifying compounds which inhibit activity of a polymerase product of polC or 
dnaE. This method is carried out by forming a reaction mixture comprising a primed 
DNA molecule, a polymerase product of polC or dnaE, a candidate compound, a 
5 dNTP, and optionally either a beta subunit, a tau complex, or both the beta subunit 

and the tau complex, wherein at least one of the polymerase product ofpolC or dnaE, 
the beta subunit, the tau complex, or a subunit or combination of subunits thereof is 
derived from a Eubacteria other than Escherichia coli; subjecting the reaction mixture 
to conditions effective to achieve nucleic acid polymerization in the absence of the 

10 candidate compound; analyzing the reaction mixture for the presence or absence of 

nucleic acid polymerization extension products; and identifying the candidate 
compound in the reaction mixture where there is an absence of nucleic acid 
polymerization extension products. 

The present invention deciphers the structure and mechanism of the 

15 chromosomal replicase of Gram positive bacteria and other bacteria lacking holC, 

holD, holE or dnaQ genes, or having a dnaX gene that encodes only one protein. 
Rather than use a DNA polymerase that attains high efficiency on its own, or with one 
other subunit, the Gram positive bacteria replicase is a three component type of 
replicase (class HI) that uses a sliding clamp protein. The Gram positive bacteria 

20 replicase also uses a clamp loader component that assembles the sliding clamp onto 

DNA. This knowledge, and the enzymes involved in the replication process, can be 
used for the purpose of screening for potential antibiotic drugs. Further, information 
about chromosomal replicases may be useful in DNA sequencing, DNA amplification, 
polymerase chain reaction, and other DNA polymerase related techniques. 

25 The present invention identifies two DNA polymerases (both of Pol III 

type) in Gram positive bacteria that utilize the sliding clamp and clamp loader. The 
present invention also identifies a gene with homology to the alpha subunit of E. coli 
DNA polymerase HI holoenzyme, the chromosomal replicase of E. coli. These DNA 
polymerases can extend a primer around a large circular natural template when the 

30 beta clamp has been assembled onto the primed ssDNA by the clamp loader or a 

primer on a linear DNA where the beta clamp may assemble by itself by sliding over 
an end. 
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The present invention shows that the clamp and clamp loader 
components of Gram negative cells can be exchanged for those of Gram positive cells 
in that the clamp, once assembled onto DNA, will function with Pol HI obtained from 
either Gram positive and Gram negative sources. This result implies that important 
contacts between the polymerase and clamp have been conserved during evolution. 
Therefore, these "mixed systems" may provide assays for an inhibitor of this 
conserved interaction. Such an inhibitor may be expected to shut down replication, 
and since the interaction is apparently conserved across the evolutionary spectrum 
from Gram positive and Gram negative cells, the inhibitor may exhibit a broad 
spectrum of antibiotic activity. 

The present invention demonstrates that Gram positive bacteria contain 
a beta subunit that behaves as a sliding clamp that encircles DNA. A dnaX gene 
sequence encoding a protein homolog of the gamma/tau subunit of the clamp loader 
(gamma/tau complex) E. coli DNA polymerase ID holoenzyme is also identified. The 
presence of this gene confirms the presence of a clamp loading apparatus in Gram 
positive bacteria that will assemble beta clamps onto DNA for the DNA polymerases. 

This application also outlines methods and assays for use of these 
replication proteins in drug screening processes. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the construction of the S. aureus Pol HI-L expression 
vector. The gene encoding Pol HI-L was cloned into a pETl 1 expression vector in a 
three step cloning scheme as illustrated. 

Figures 2A-C describe the expression and purification of S. aureus Pol 
HI-L (alpha-large). Figure 2 A compares E, coli cells that contain the pETl lPolC 
expression vector that are either induced or uninduced for protein expression. The gel 
is stained with Coomassie Blue. The induced band corresponds to the expected 
molecular weight of the S. aureus Pol HI-L, and is indicated to the right of the gel. 
Figure 2B shows the results of the MonoQ chromatography of a lysate of E. coli 
(pETl lPolC-L) induced for Pol HI-L. The fractions were analyzed in a Coomassie 
Blue stained gel (top) and for DNA synthesis (bottom). Fractions containing Pol HI-L 
are indicated. In Figure 2C, fractions containing Pol m-L from the MonoQ column 
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were pooled and chromatographed on a phosphocellulose column. This shows an 
analysis of the column fractions from the phosphocellulose column in a Coomassie 
Blue stained polyacrylamide gel. The position of Pol III-L is indicated to the right. 

Figure 3 shows the 5. aureus beta expression vector. The dnaN gene 
5 was amplified from S. aureus genomic DNA and cloned into the pET16 expression 

vector. 

Figures 4A-C illustrate the expression and purification of S. aureus 
beta. Figure 4A compares E. coli cells that contain the pET16beta expression vector 
that are either induced or uninduced for protein expression. The gel is stained with 

10 Coomassie Blue. The induced band corresponds to the expected molecular weight of 

the & aureus beta, and is indicated to the right of the gel. The migration position of 
size standards are indicated to the left. Figure 4B shows the results of MonoQ 
chromatography of an E. coli (pET16beta) lysate induced for beta. The fractions were 
analyzed in a Coomassie Blue stained gel, and fractions containing beta are indicated. 

15 In Figure 4C, fractions containing beta from the MonoQ column were pooled and 

chromatographed on a phosphocellulose column. This shows an analysis of the 
column fractions from the phosphocellulose column in a Coomassie Blue stained 
polyacrylamide gel. The position of beta is indicated to the right. 

Figures 5 A-B demonstrate that the S. aureus beta stimulates 5. aureus 

20 Pol m-L and E. coli Pol IH core on linear DNA, but not circular DNA. In Figure 5 A, 

the indicated proteins were added to replication reactions containing polydA-oligodT 
as described in the Examples infra. Amounts of proteins added, when present, were: 
lanes 1,2: £ aureus Pol III-L, 7.5 ng; S. aureus beta, 6.2 jj,g; Lanes 3,4: E. coli Pol III 
core, 45 ng; S. aureus beta, 9.3 jag; Lanes 5,6: E. coli Pol m core, 45 ng; E. coli beta, 

25 5fig, Total DNA synthesis was: Lanes 1-6: 4.4, 30.3, 5.1, 35.5, 0.97, 28.1 pmol, 

respectively. In Figure 5B, Lanes 1-3, the indicated proteins were added to replication 
reactions containing circular singly primed M13mpl8 ssDNA as described in the 
Examples infra. S. aureus beta, 0.8 |ag; S. aureus Pol III-L, 300 ng (purified through 
MonoQ); E. coli clamp loader complex, 1.7 j^g. Results in the E. coli system are 

30 shown in Lanes 4-6. Total DNA synthesis was: Lanes 1-6: 0.6, 0.36, 0.99, 2.7, 3.5, 

280 pmol, respectively. 

Figure 6 shows that S. aureus Pol III-L functions with E. coli beta and 
clamp loader complex on circular primed DNA. It also shows that S. aureus beta does 
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not convert Pol III-L with sufficient processivity to extend the primer all the way 
around a circular DNA. Replication reactions were performed on the circular singly 
primed M13mpl8 ssDNA. Proteins added to the assay are as indicated in this figure. 
The amount of each protein, when present, is: S. aureus beta, 800 ng; S. aureus Pol 
m-L, 1500 ng (MonoQ fraction 64); E. coli Pol m core, 450 ng; E. coli beta, 100 ng; 
E. coli gamma complex, 1 720 ng. Total DNA synthesis in each assay is indicated at 
the bottom of the figure. 

Figures 7A-B show that S. aureus contains four distinct DNA 
polymerases. Four different DNA polymerases were partially purified from S. aureus 
cells. S. aureus cell lysate was separated from DNA and, then, chromatographed on a 
MonoQ column. Fractions were analyzed for DNA polymerase activity. Three peaks 
of activity were observed. The second peak was the largest and was expected to be a 
mixture of two DNA polymerases based on early studies in B. subtilis. 
Chromatography of the second peak on phosphocellulose (Figure 7B) resolved two 
DNA polymerases from one another. 

Figures 8A-B show that S. aureus has two DNA Pol IITs. The four 
DNA polymerases partially purified from S. aureus extract, designated peaks I-IV in 
Figure 7, were assayed on circular singly primed Ml 3mpl 8 ssDNA coated with E. 
coli SSB either in the presence or absence of E. coli beta (50ng) and clamp loader 
complex (50 ng). Each reaction contained 2 \x\ of the partially pure polymerase (Peak 
1 was Mono Q fraction 24 (1.4 jug), Peak 2 was phosphocellulose fraction 26 (0.016 
mg/ml), Peak 3 was phosphocellulose fraction 46 (0.18 mg/ml), and Peak 4 was 
MonoQ fraction 50 (1 fig). Figure 8A shows the product analysis in an agarose gel. 
Figure 8B shows the extent of DNA synthesis in each assay. 

Figure 9 compares the homology between the polypeptide encoded by 
dnaE ofS. aureus and other organisms. An alignment is shown for the amino acid 
sequence of the S. aureus dnaE product with the dnaE products (alpha subunits) of E. 
coli and Salmonella typhimurium. 

Figure 1 0 compares the homology between the N-terminal regions of 
the gamma/tau polypeptides of S. aureus, B. subtilis, and E. coli. The conserved ATP 
site and the cystines forming the zinc finger are indicated above the sequence. The 
organisms used in the alignment were: E. coli (GenBank); and B. subtilis. 
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Figure 1 1 compares the homology between the DnaB polypeptide of 
S. aureus and other organisms. The organisms used in the alignment were: E. coli 
(GenBank); B. subtilis; Sal.Typ., (Salmonella typhimurium). 

Figures 12A-B show the alignment of the delta subunit encoded by 
5 holA for E. coli and B. subtilis (Figure 12 A) and for the delta subunit of B. subtilis and 

& pyogenes (Figure 12B). Figure 12A shows ClustalW generated alignment of S. 
pyogenes (Gram positive) delta to E.coli (Gram negative) delta. Figure 12B shows 
ClustalW generated alignment of B. subtilis (Gram positive) delta to S. pyogenes 
(Gram positive) delta. 

10 Figure 13 is an image of an autoradiograph of an agarose gel analysis 

of replication products from singly primed, SSB coated M13mpl8 ssDNA using the 
reconstituted S. aureus Pol HI holozyme. Only in the presence of the x88' complex 
does a-large (PolC) function with 0 to replicate a full circular duplex DNA (RFII). 

Figure 14 shows a Comassie Blue stained SDS polyacrylamide gel of 

15 the pure S. pyogenes subunits corresponding to alpha-large, alpha-small, dnaX gene 

product (called tau), beta, delta, delta prime, and SSB. The first lane shows the 
position of molecular weight markers. Purified proteins were separated on a 15% 
SDS-PAGE and stained with Coommassie Brilliant Blue R-250. Each lane contains 5 
microgram of each protein. Lane 1, markers; lane 2, alpha-large; lane 3, alpha-small, 

20 lane 4, tau subunit; lane 5, beta subunit; lane 6, delta subunit; lane 7, delta prime 

subunit; lane 8, single strand DNA binding protein. 

Figures 15A-C document the ability to reconstitute the t55' complex of 
& pyogenes. Proteins were mixed and gel filtered on Superose 6, followed by analysis 
of the column fractions in a SDS polyacrylamide gel. Figure 15 A shows a mixture of 

25 x55'. Figure 15B shows a mixture of t5. Figure 15C shows a mixture of t5'. 

Figures 16A-E show that the S. pyogenes T85' complex can load the S. 
pyogenes beta clamp onto (circular) DNA. Loading reactions contained 500 fin 
nicked pBSK plasmid, 500 fm either t85' complex, tau, delta, or delta prime, 1pm 32 P- 
labelled beta dimer, 8 mM MgCl2, 1 mM ATP. Reaction components were 

30 preincubated for 10 min at 37°C prior to loading onto 5 ml Biogel A15M column 

equilibrated with buffer A containing 100 mM NaCl. Figure 16A demonstrates the 
ability of x88* complex to load the beta dimer onto a nicked pBSK circular plasmid. 
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Figures 16B-E show the results of using either: beta alone (Figure 16B); 55' plus p 
(Figure 16C); x, 8 and p (Figure 16D); x, 5' and p (Figure 16E). 

Figures 17A-C show that x and alpha interact. Figure 17A shows the 
result of gel filtration analysis of a mixture of t with alpha-large. Gel filtration 
5 fractions are analyzed in a SDS polyacrylamide gel. Figures 17B and 17C show the 

results using only x or only alpha-large, respectively. Comparison of the elution 
positions of proteins shows that the positions of alpha and tau are shifted toward a 
higher molecular weight complex when they are present together. The fact they do 
not exactly comigrate may indicate that they initially are together in a complex, but 
10 that the complex dissociates during the time of the gel filtration experiment (over one 

half hour). 

Figures 18A-B document the ability to reconstitute a L T55 f (pol HI*) 
complex of S. pyogenes. Proteins were mixed, preincubated for 20 min at 1 5°C, gel 
filtered on Superose 6, followed by analysis of the column fractions in a SDS 
15 polyacrylamide gel (Figure 18 A). Proteins were loaded on a MonoQ column, then 

eluted with a linear gradient of 50-500 mM NaCl, followed by analysis of the column 
fractions in a SDS polyacrylamide gel (Figure 18B). The 0^x55' complex migrates 
early. 

Figure 19 illustrates the speed and processivity of the S. pyogenes 
20 aixSS 1 (pol HI*) complex. The airSS' (pol m*) complex was incubated with primed 

Ml 3pm 18 ssDNA (coated with S. pyogenes SSB) and only two dNTPs, then 
replication was initiated upon adding the remaining two dNTPs. Reactions contained 
25 finol singly primed M13mpl8 ssDNA template, 300 frnol p 2 , and either 75 finol or 
250 frnol a L TS5\ Time points were quenched with SDS/EDTA then analyzed in a 
25 neutral agarose gel followed by autoradiography. Each time point is a separate 

reaction. The time course of polymerization was performed at two different ratios of 
polymerase/primed template to assess speed and processivity of nucleotide 
incorporation. 

Figures 20A-I show the extent of homology between S. pyogenes 
30 replication genes and other organisms. Due to the low homology of delta 

(Figure 20D), one must 4t walk" from one organism to the next in order to recognize 
the homologue with high probability. Percent identity over regions of the indicated 
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number of amino acid residues is shown for each match (i.e., the two organisms at the 
opposite ends of each line). Amino acid sequences were retrieved from either 
GenBank or individual unfinished genome databases. 

Figure 21 A-F are images illustrating that the S. pyogenes DnaE (alpha- 
5 small) polymerase functions with p. Figures 21 A-B illustrate the relationship between 

DnaE and p for association with ssDNA. Different amounts of DnaE polymerase 
were added to a SSB coated M13mpl8 ssDNA circle primed with a single DNA 
oligonucleotide, and products were analyzed in a native agarose gel. Reactions were 
performed in the presence of tS5' and either the absence (Figure 21C, panels 1-4) or 
10 presence (Figure 2 ID, panels 1-4) of p. Positions of completed duplex (RFII) and 

initial primed template (ssDNA) are indicated. Figure 21E shows an analysis of 
exonuclease activity by PolC and DnaE on a 5-32P-DNA 30-mer. Aliquots were 
removed at the indicated times and analyzed in a sequencing gel. Figure 2 IF shows 
the effect of TMAU on PolC and DnaE in the presence of t55' and p. DNA products 
15 were analyzed in a native agarose gel. Positions of initial primed M13mpl8 (ssDNA) 

and completed circular duplex (RFII) are indicated. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention relates to various isolated nucleic acid molecules 
from Gram positive bacteria and other bacteria lacking holC, holD, or holE genes or 
having a dnaX gene encoding only one subunit. These include DNA molecules which 
correspond to the coding regions of the dnaE, dnaX, holA, holB, polC, dnaN, SSB, 
dnaB, and dnaG genes. These DNA molecules can be inserted into an expression 
system or used to transform host cells. The isolated proteins or polypeptides encoded 
by these DNA molecules and their use to form a three component polymerase are also 
disclosed. Also encompassed by the present invention are corresponding RNA 
molecules transcribed from the DNA molecules. 

These DNA molecules and proteins can be derived from numerous 
bacteria, including Staphylococcus,, Streptococcus, Enterococcus, Mycoplasma, 
Mycobacterium, Borrelia, Treponema, Rickettsia, Chlamydia, Helicobacter, and 
Thermatoga. It is particularly directed to such DNA molecules and proteins derived 
from Streptococcus and Staphylococcus bacteria, particularly Streptococcus pyogenes 
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and Staphylococcus aureus (see U.S. Patent Application Serial No. 09/235,245, which 
is hereby incorporated by reference). 

The gene sequences used to obtain DNA molecules of the present 
invention were obtained by sequence comparisons with the E. coli counterparts, 
followed by detailed analysis of the raw sequence data in the contigs from the S. 
pyogenes database (http://dnal.chem.ou.edu/strep.html) or the S. aureus database 
(http://www.genome.ou.edu/staph.html) to identify the open reading frames. In many 
instances, nucleotide errors were observed causing frameshifts in the open reading 
frame thus truncating it. Therefore, upon cloning the genes via PCR, the genes were 
sequenced to obtain correct information. Also, the full nucleotide sequence of the ssb 
gene was not present in the data base. This was cloned by circular PCR and the full 
sequence is reported below. 

The S. aureus dnaXand dnaE genes were identified by aligning genes 
of several organisms and designing primers for use in PCR to obtain a gene fragment, 
followed by steps to identify the entire gene. 

One aspect of the present invention relates to a newly discovered Pol 
HI gene (herein identified as dnaE) of S. aureus whose encoded protein is homologous 
to E. coli alpha (product of dnaE gene). The partial nucleotide sequence of the S. 
aureus dnaE gene corresponds to SEQ. ED. No. 1 as follows: 



atggtggcat 
gaagatgccg 
aatgtattgt 
atttttggta 
gctaaaaata 
gcattagaac 
tttaaaaaag 
acatatatgg 
gtttgttacc 
aatacaaaat 
aaggaaatta 
caaaagtgtg 
aatgatgaat 
gaacttaatt 
atgggttttg 
gatgtgatgg 
ggaattacaa 
gaacgtgtaa 
attcagtacg 
catctgcttg 
acattaaatg 
tatcaaattg 
agtatttgta 
attattaatg 
ttaacgcaat 
gggttgagaa 



atttaaatat 
taagacttgc 
atggttttcc 
tgacaatata 
atgatggatt 
atgtgtcgtt 
tcggtgatca 
accaccttag 
aaacacgtca 
tagacttaat 
atcaattaga 
atgcagaatt 
cagctaaaaa 
atgacgtcta 
aagattattt 
taggtcctgg 
cgattgatcc 
caatgcctga 
tccaagaaaa 
caagagcagt 
aaatttcaag 
acgattttaa 
aaaagttaga 
accatccatt 
ggacaatgac 
acttatcgat 



tcatacggct 
tgtgtctgaa 
taaattttat 
tgtgacaaat 
aaaagatttg 
tgaattatta 
acatcgtgat 
tatttcgatt 
agatgccgat 
tcatgatcaa 
tattaaccaa 
aaaatatcat 
atatttgtgg 
tttagagcga 
cttaatagta 
tcgtggttct 
tattaaattc 
tattgatatt 
atatggcgag 
tgctagagat 
tttaatccca 
agagtttgta 
aggtttacca 
atatgaatat 
tgaagccgaa 
tattcatcaa 



tatgatttgt 
aatgttgatg 
gatgcatgta 
ggattaaata 
tatcaactat 
aaacgatttt 
attgtacaag 
caaggtagaa 
acgatttctg 
gaagattttg 
gaatatttaa 
caatctctac 
cgtgtcttag 
ttgaaatatg 
agtgatttaa 
tcagctggct 
aatctattat 
gactttgaag 
ctacatgtat 
gttggaagaa 
cataaattag 
catcgaaacc 
agacatacat 
gcccctttaa 
cgtattgggt 
atcttaacac 



taaattcaag 
cacttgccat 
tagcaaataa 
cagtcgaaac 
catcggaaat 
ctaacaatat 
tgtttgaaac 
aacatgtttg 
cattagcagc 
gtgcacattt 
cgcaggttga 
ttcctcaata 
ttacacaatt 
agtataaagt 
tccattatgc 
cactggtcag 
ttgaacgttt 
atacacgccg 
ctggaattgt 
ttatggggtzt 
gaattacact 
atcgacatga 
ctacacatgc 
cgaaagggga 
tattaaaaat 
aagtcaaaaa 



cttaaaaata 
aactgacacc 
cattaaaccg 
agttgttcta 
aaaaatgaat 
gattatcatt 
ccataatgac 
gattcaaaat 
tattagagac 
tttaactgaa 
tgttatagct 
tgagacacct 
gaaaaaatta 
tattactaat 
gaaaacgaat 
ttatttattg 
tttaaaccca 
agaaagggtc 
gactttcggt 
tgatgaagtt 
tgatgaagca 
acgctggttc 
ggcaggaatt 
tacaggatta 
agattttcta 
agatttaggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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attaatattg 
caaggagata 
aaattaaagc 
ccaatggaag 
ttacatccgc 
caaattatgc 
agaagagcaa 
gaaggtgcaa 
ctgaaatttg 
tacattatga 
aatgttattg 
atcactatat 
ggcatttatt 
gttgatgaac 
ccgaagagag 
gcttttggta 
ttaaacattg 
gataaagaag 
tatgtttcgc 
aaattgagta 
attcgaacta 
ttagatggtg 
gacttgttta 
aatgagattc 
ataattagaa 
aatgctaatg 
ggctatatta 
gatattaggc 



atatcgaaaa 
cgactggcat 
cggaacactt 
aaattccaac 
atttagaacc 
aaatagcgag 
tgagtaaaaa 
agcaaaatgg 
ctgattatgg 
gctttttaaa 
gaagtgagaa 
tgccaccgaa 
tatcaattgg 
gttatcagaa 
tcaaaacgag 
aaacacgttc 
aacaagatgg 
aattgcctga 
aacacccagt 
acgcgcagaa 
aaaatggtca 
tgattttccc 
tagttagcgg 
agacattagc 
ataaatcaca 
atgttgtgtt 
atcaaaaaga 
ttata 



gattccgttt 
attccaatta 
tgaagatatt 
ttacattaca 
Catattaaaa 
cacatttgca 
aaatagagct 
ttatcacgaa 
ttttcctaga 
agtccattat 
gaaaactgct 
cattaacgaa 
tacaattaaa 
cggcaaattt 
aaagttactt 
aacgttgttg 
ttttttattt 
tgcacttatt 
agataaaaag 
ttataaacct 
aaatatggca 
taatcagttt 
gaaatttgac 
cacttttgaa 
aatagatatg 
atccttttat 
tagtatgttt 



gatgatcaaa 
gagtctgacg 
gttgctgtaa 
agaagacatg 
aatacttacg 
aacttcagtt 
gttcttgaaa 
gacattagta 
gcacatgctg 
ccaaattatt 
caaatgatag 
agtcattggt 
ggtgttggtt 
aaagatttct 
gaagcactga 
caagctattg 
gatattttaa 
agtcagtacg 
tttgttgcca 
atattagtac 
ttcgtcacat 
aaaaagtacg 
catagaaagc 
gaacaaaaat 
tttgaagaga 
gatgaaacga 
aataatttta 



aagtgtttga 
gtgtaagaag 
cttctttgta 
atccaagcaa 
gtgttattat 
atggtgaagc 
gtgagcgtca 
agcaaatatt 
tcagctattc 
tttacgcaaa 
aagaagcaaa 
tttataaacc 
atcaaagtgt 
ttgattttgc 
ttttagtggg 
atcaagtgtt 
cgccaaaaca 
aaaaagaata 
aacaatattt 
agtttgataa 
taaatgatgg 
aagagttgtt 
aacaacgtca 
tagcatttgc 
tgattaaagc 
ttaaacaaat 
tacaatcctt 



attgttgtcg 


1620 


tgtattaaaa 


1680 


tagaccaggt 


1740 


agttcaatat 


1800 


ttatcaagag 


1860 


ggatatttta 


1920 


acattttata 


1980 


tgat ttgatt 


2040 


taaaat tgca 


2100 


tattttaaat 


2160 




2220 




2280 


qaaaatqatt 


2340 


tagacgtata 


2400 


agcg t t tga t 


2460 




2520 


gatgtatgaa 


2580 


tttaggattt 


2640 


aacgatattt 


2700 


agttaaacaa 


2760 


cattgaaact 


2820 


atcacataat 


2880 


actaattata 


2940 


caaacaaatt 


3000 


tacgaaagag 


3060 


gactacttta 


3120 


taaccctagt 


3180 




3195 



30 The S. aureus dnaE encoded protein, called a-small, has an amino acid 

sequence corresponding to SEQ. ID. No. 2 as follows: 
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Met Val Ala 
1 

Ser Leu Lys 



Asp Ala Leu 
35 

Phe Tyr Asp 
50 

Thr lie Tyr 
65 

Ala Lys Asn 
lie Lys Met 



Phe Ser Asn 
115 

Arg Asp lie 
130 

His Leu Ser 
145 



Tyr Leu Asn lie 
5 

lie Glu Asp Ala 
20 

Ala He Thr Asp 



Ala Cys lie Ala 
55 

Val Thr Asn Gly 
70 

Asn Asp Gly Leu 
85 

Asn Ala Leu Glu 
100 

Asn Met He lie 



Val Gin Val Phe 
135 

He Ser He Gin 
150 



His Thr Ala 
10 

Val Arg Leu 
25 

Thr Asn Val 
40 

Asn Asn He 



Leu Asn Thr 

Lys Asp Leu 
90 

His Val Ser 
105 

He Phe Lys 
120 

Glu Thr His 
Gly Arg Lys 



Tyr Asp Leu 



Ala Val Ser 



Leu Tyr Gly 
45 

Lys Pro He 
60 

val Glu Thr 

75 

Tyr Gin Leu 



Phe Glu Leu 



Lys Val Gly 
125 

Asn Asp Thr 
140 

His Val Trp 

155 



Leu Asn Ser 
15 

Glu Asn Val 
30 

Phe Pro Lys 



Phe Gly Met 

Val Val Leu 
80 

Ser Ser Glu 
95 

Leu Lys Arg 
110 

Asp Gin His 
Tyr Met Asp 



He Gin Asn 
160 



WO 01/09164 



-27- 



PCT/USOO/20666 



Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr lie Ser Ala Leu Ala 
165 170 175 

Ala lie Arg Asp Asn Thr Lys Leu Asp Leu lie His Asp Gin Glu Asp 
180 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu lie Asn Gin Leu Asp lie 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val lie Ala Gin Lys Cys Asp 
210 215 220 

Ala Glu Leu Lys Tyr His Gin Ser Leu Leu Pro Gin Tyr Glu Thr Pro 
225 230 235 240 

Asn Asp Glu Ser Ala Lys Lys Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 

Leu Lys Lys Leu Glu Leu Asn Tyr Asp Val Tyr Leu Glu Arg Leu Lys 
260 265 270 

Tyr Glu Tyr Lys Val lie Thr Asn Met Gly Phe Glu Asp Tyr Phe Leu 
275 280 285 

lie Val Ser Asp Leu lie His Tyr Ala Lys Thr Asn Asp Val Met Val 
290 295 300 

Gly Pro Gly Arg Gly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly lie Thr Thr lie Asp Pro lie Lys Phe Asn Leu Leu Phe Glu Arg 
325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp lie Asp lie Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val lie Gin Tyr Val Gin Glu Lys Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly lie Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
385 390 395 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 
405 410 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser He Cys Lys Lys Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 460 

His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 480 

Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
485 490 495 
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lie Asp Phe Leu Gly Leu Arg Asn Leu Ser lie lie His Gin lie Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly lie Asn lie Asp lie Glu Lys lie 
515 520 v 525 

Pro Phe Asp Asp Gin Lys Val Phe Glu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 

Lys Leu Lys Pro Glu His Phe Glu Asp He Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu lie Pro Thr Tyr lie Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Leu Glu Pro lie 
595 600 605 

Leu Lys Asn Thr Tyr Gly Val He He Tyr Gin Glu Gin He Met Gin 
610 615 620 

lie Ala Ser Thr Phe Ala Asn Phe Ser Tyr Gly Glu Ala Asp He Leu 
625 630 635 640 

Arg Arg Ala Met Ser Lys Lys Asn Arg Ala Val Leu Glu Ser Glu Arg 
645 650 655 

Gin His Phe He Glu Gly Ala Lys Gin Asn Gly Tyr His Glu Asp He 
660 665 670 

Ser Lys Gin lie Phe Asp Leu He Leu Lys Phe Ala Asp Tyr Gly Phe 
675 680 685 

Pro Arg Ala His Ala Val Ser Tyr Ser Lys He Ala Tyr He Met Ser 
690 695 700 

Phe Leu Lys Val His Tyr Pro Asn Tyr Phe Tyr Ala Asn He Leu Ser 
705 710 715 720 

Asn Val He Gly Ser Glu Lys Lys Thr Ala Gin Met He Glu Glu Ala 
725 730 735 

Lys Lys Gin Gly He Thr He Leu Pro Pro Asn He Asn Glu Ser His 
740 745 750 

Trp Phe Tyr Lys Pro Ser Gin Glu Gly He Tyr Leu Ser He Gly Thr 
755 760 765 

He Lys Gly Val Gly Tyr Gin Ser Val Lys Val He Val Asp Glu Arg 
770 775 780 

Tyr Gin Asn Gly Lys Phe Lys Asp Phe Phe Asp Phe Ala Arg Arg He 
785 790 795 800 

Pro Lys Arg Val Lys Thr Arg Lys Leu Leu Glu Ala Leu He Leu Val 
805 810 815 

Gly Ala Phe Asp Ala Phe Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
820 825 830 
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lie Asp Gin Val Leu Asp Gly Asp Leu Asn lie Glu Gin Asp Gly Phe 
835 840 845 

Leu Phe Asp lie Leu Thr Pro Lys Gin Met Tyr Glu Asp Lys Glu Glu 
5 850 855 860 

Leu Pro Asp Ala Leu lie Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 
865 870 875 880 

10 Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 

885 890 895 



15 



30 



45 



Leu Thr lie Phe Lys Leu Ser Asn Ala Gin Asn Tyr Lys Pro lie Leu 
900 905 910 

Val Gin Phe Asp Lys Val Lys Gin lie Arg Thr Lys Asn Gly Gin Asn 
915 920 925 



Met Ala Phe Val Thr Leu Asn Asp Gly lie Glu Thr Leu Asp Gly Val 
20 930 935 940 

lie Phe Pro Asn Gin Phe Lys Lys Tyr Glu Glu Leu Leu Ser His Asn 
945 950 955 960 

25 Asp Leu Phe lie Val Ser Gly Lys Phe Asp His Arg Lys Gin Gin Arg 

965 970 975 



Gin Leu lie lie Asn Glu lie Gin Thr Leu Ala Thr Phe Glu Glu Gin 

980 985 990 

Lys Leu Ala Phe Ala Lys Gin lie lie lie Arg Asn Lys Ser Gin lie 

995 1000 1005 



Asp Met Phe Glu Glu Met lie Lys Ala Thr Lys Glu Asn Ala Asn Asp 

35 1010 1015 1020 

Val Val Leu Ser Phe Tyr Asp Glu Thr lie Lys Gin Met Thr Thr Leu 

1025 1030 1035 1040 

40 Gly Tyr lie Asn Gin Lys Asp Ser Met Phe Asn Asn Phe lie Gin Ser 

1045 1050 1055 



Phe Asn Pro Ser Asp lie Arg Leu lie 
1060 1065 

The present invention also relates to the S. aureus dnaX gene. This 
£ aureus dnaX gene has a partial nucleotide sequence corresponding to SEQ. ID. 
No. 3 as follows: 



50 



55 



60 



ttgaattatc 
caagaacatg 
tatattttta 
gcaatcaact 
ggcattacgc 
gttgatgaaa 
aaagtttata 
aagacgttag 
aaaatccctc 
gatcaaattg 
gaagccttgg 



aagccttata 
tcacgaagac 
gtggtccgag 
gtttaaatag 

aggggactaa 

taagaaatat 
ttatagatga 
aagaacctcc 
caacaatcat 
ttgaacgttt 
catttatcgc 



tcgtatgtac 
attgcgcaat 
aggtacgggg 
cactgatgga 
ttcagatgtg 
tagagacaaa 
ggtgcacatg 
agcacacgct 
ttctagggca 
aaaatttgta 
taaagcgtct 



agaccccaaa 
gcgatttcga 
aaaacgagta 
gaaccttgta 
atagaaattg 
gttaaatatg 
ctaacaacag 
atttttatat 
caacgttttg 
gcagatgcac 
gaagggggta 



gtttcgagga 
aagaaaaaca 
ttgccaaagt 
atgaatgtca 
atgctgctag 
caccaagtga 
gtgcttttaa 
tggcaacgac 
attttaaagc 
aacaaattga 
tgcgtgatgc 



tgtcgtcgga 
gtcgcatgca 
gtttgctaaa 
tatttgtaaa 
taataatggc 
atcgaaatat 
tgccctttta 
agaaccacat 
aattagccta 
atgtgaagat 
attaagtatt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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atggatcagg ctattgcttt cggcgatggc acattgacat tacaagatgc cctaaatgtt 720 

acgggtagcg ttcatgatga agcgttggat cacttgtttg atgatattgt acaaggtgac 780 

gtacaagcat cttttaaaaa ataccatcag tttataacag aaggtaaaga agtgaatcgc 84 0 

ctaataaatg atatgattta ttttgtcaga gatacgatta tgaataaaac atctgagaaa 900 

5 gatactgagt atcgagcact gatgaactta gaattagata tgttatatca aatgattgat 960 

cttattaatg atacattagt gtcgattcgt tttagtgtga atcaaaacgt tcattttgaa 1020 

gtattgttag taaaattagc tgagcagatt aagggtcaac cacaagtgat tgcgaatgta 1080 

gctgaaccag cacaaattgc ttcatcgcca aacacagatg tattgttgca acgtatggaa 1140 

cagttagagc aagaactaaa aacactaaaa gcacaaggag tgagtgttgc tcctactcaa 1200 

10 aaatcttcga aaaagcctgc gagaggtata caaaaatcta aaaatgcatt ttcaatgcaa 1260 

caaattgcaa aagtgctaga taaagcgaat aaggcagata tcaaattgtt gaaagatcat 1320 

tggcaagaag tgattgacca tgcccaaaac aatgataaaa aatcactcgt tagtttattg 1380 

caaaattcgg aacctgtggc ggcaagtgaa gatcacgtcc ttgtgaaatt tgaggaagag 144 0 

atccattgtg aaatcgtcaa taaagacgac gagaaacgta gtagtataga aagtgttgta 1500 

15 tgtaatatcg ttaataaaaa cgttaaagtt gttggtgtac catcagatca atggcaaaga 1560 

gttcgaacgg agtatttaca aaatcgtaaa aacgaaggcg atgatatgcc aaagcaacaa 1620 

gcacaacaaa cagatattgc tcaaaaagca aaagatcttt tcggtgaaga aactgtacat 1680 

gtgatagatg aagagtga 1698 



20 



The S, aureus dnaX encoded protein (i.e., the tau subunit) has a partial 
amino acid sequence corresponding to SEQ. ID. No. 4 as follows: 



Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Phe Glu 
25 15 10 15 

Asp Val Val Gly Gin Glu His Val Thr Lys Thr Leu Arg Asn Ala lie 
20 25 30 

30 Ser Lys Glu Lys Gin Ser His Ala Tyr lie Phe Ser Gly Pro Arg Gly 

35 40 45 



35 



50 



Thr Gly Lys Thr Ser lie Ala Lys Val Phe Ala Lys Ala lie Asn Cys 

50 55 60 

Leu Asn Ser Thr Asp Gly Glu Pro Cys Asn Glu Cys His lie Cys Lys 
65 70 75 80 



Gly lie Thr Gin Gly Thr Asn Ser Asp Val lie Glu lie Asp Ala Ala 

40 85 90 95 

Ser Asn Asn Gly Val Asp Glu lie Arg Asn lie Arg Asp Lys Val Lys 
100 105 110 

45 Tyr Ala Pro Ser Glu Ser Lys Tyr Lys Val Tyr lie lie Asp Glu Val 

115 ^ 120 125 



His Met Leu Thr Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 140 

Glu Pro Pro Ala His Ala lie Phe lie Leu Ala Thr Thr Glu Pro His 

145 150 155 160 



Lys lie Pro Pro Thr lie lie Ser Arg Ala Gin Arg Phe Asp Phe Lys 
55 165 170 175 

Ala lie Ser Leu Asp Gin lie Val Glu Arg Leu Lys Phe Val Ala Asp 
180 185 190 

60 Ala Gin Gin lie Glu Cys Glu Asp Glu Ala Leu Ala Phe lie Ala Lys 

195 200 205 
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Ala Ser Glu Gly Gly Met Arg Asp Ala Leu Ser lie Met Asp Gin Ala 
210 215 220 

lie Ala Phe Gly Asp Gly Thr Leu Thr Leu Gin Asp Ala Leu Asn Val 
225 230 235 240 

Thr Gly Ser Val His Asp Glu Ala Leu Asp His Leu Phe Asp Asp lie 
245 250 255 

Val Gin Gly Asp Val Gin Ala Ser Phe Lys Lys Tyr His Gin Phe lie 
260 265 270 

Thr Glu Gly Lys Glu Val Asn Arg Leu lie Asn Asp Met lie Tyr Phe 
275 280 285 

Val Arg Asp Thr lie Met Asn Lys Thr Ser Glu Lys Asp Thr Glu Tyr 
290 295 300 

Arg Ala Leu Met Asn Leu Glu Leu Asp Met Leu Tyr Gin Met lie Asp 
305 310 315 320 

Leu lie Asn Asp Thr Leu Val Ser lie Arg Phe Ser Val Asn Gin Asn 
325 330 335 

Val His Phe Glu Val Leu Leu Val Lys Leu Ala Glu Gin He Lys Gly 
340 345 350 

Gin Pro Gin Val He Ala Asn Val Ala Glu Pro Ala Gin lie Ala Ser 
355 360 365 

Ser Pro Asn Thr Asp Val Leu Leu Gin Arg Met Glu Gin Leu Glu Gin 
370 375 380 

Glu Leu Lys Thr Leu Lys Ala Gin Gly Val Ser Val Ala Pro Thr Gin 
385 390 395 400 

Lys Ser Ser Lys Lys Pro Ala Arg Gly He Gin Lys Ser Lys Asn Ala 
405 410 415 

Phe Ser Met Gin Gin lie Ala Lys Val Leu Asp Lys Ala Asn Lys Ala 
420 425 430 

Asp He Lys Leu Leu Lys Asp His Trp Gin Glu Val He Asp His Ala 
435 440 445 

Gin Asn Asn Asp Lys Lys Ser Leu Val Ser Leu Leu Gin Asn Ser Glu 
450 455 460 

Pro Val Ala Ala Ser Glu Asp His Val Leu Val Lys Phe Glu Glu Glu 
465 470 475 480 

He His Cys Glu He Val Asn Lys Asp Asp Glu Lys Arg Ser Ser He 
485 490 495 

Glu Ser Val Val Cys Asn He Val Asn Lys Asn Val Lys Val Val Gly 
500 505 510 

Val Pro Ser Asp Gin Trp Gin Arg Val Arg Thr Glu Tyr Leu Gin Asn 
515 520 525 

Arg Lys Asn Glu Gly Asp Asp Met Pro Lys Gin Gin Ala Gin Gin Thr 
530 535 540 
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Asp lie Ala Gin Lys Ala Lys Asp Leu Phe Gly Glu Glu Thr Val His 

545 550 555 560 

Val lie Asp Glu Glu Glx 
5 565 

The tau subunit of S. aureus functions as does both the tau subunit and the gamma 
subunit of E. coli. 

This invention also relates to the partial nucleotide sequence of the 
10 S. aureus dnaB gene. The partial nucleotide sequence of this dnaB gene corresponds 

to SEQ. ID. No. 5 as follows: 



15 



20 



25 



30 



35 



atggatagaa 
ttaggttcaa 
gagtcgtttt 
gataataaag 
aatgaagcgg 
aatgttcagt 
actgcagata 
agtgatgcag 
gacattcgag 
ggtcaaacac 
aaccgaaatg 
cttaatattg 
ctagagatgg 
tcaaaccgct 
gtaggtaaat 
gatttacgtt 
gactacttac 
gtttctgaaa 
gcattaagtc 
gatattcgtg 
gatgattact 
caaacgaatg 
acaggcacag 
gcacatgcag 



tgtatgagca 
ttattataga 
ataggggtgc 
aaattgatgt 
gtggcccgca 
attatactga 
gtattgccaa 
aacgtcgaat 
acgtcttagg 
caggtatacc 
atttaattat 
cacaaaaagt 
gtgctgatca 
taagaacggg 
tatcacgtac 
ctaaatgtcg 
agttgattca 
tctctcgtac 
agttatctcg 
aatctggttc 
ataaccgtgg 
atgaaaacgg 
ttaagttaca 
atatgatg 



aaatcaaatg 
tccagaattg 
ccatcaacat 
tgtaacattg 
atatcttgca 
tatcgtttct 
tgatggatat 
tttagagcta 
acaagtgtat 
tacaggatat 
ccttgcagcg 
tgcaacgcat 
gttagccaca 
tactatgact 
gaagattttt 
tcgattaaag 
aggtagtggt 
attaaaagca 
tggtgttgaa 
gattgagcaa 
cggcgatgaa 
tgaaattgaa 
ttttatgaaa 



ccgcataaca 
attaatacta 
attttccgtg 
atggatcaat 
gagttatcta 
aagcatgcat 
aatgatgaac 
tcatcttctc 
gaaacagctg 
cgagatttag 
cgtccatctg 
gaagatatgt 
cgtatgattt 
gaggaagatt 
attgatgata 
caagaacatg 
tcacgtgcgt 
ttagcccgtg 
caacgacaag 
gatgccgata 
gatgatgacg 
attatcattg 
caatataata 



atgaagctga 
ctcaggaagt 
caatgatgca 
tatcgacgga 
caaatgtacc 
taaaacgtag 
ttgaactaga 
gtgaaagcga 
aagagcttga 
accaaatgac 
taggtaagac 
atacagttgg 
gtagttctgg 
ggagtcgttt 
caccgggtat 
gcttagacat 
ccgataacag 
aattaaaatg 
ataaacgtcc 
tcgttgcatt 
atgatggtgg 
ctaagcaacg 
aatttaccga 



acagtctgtc 
tttgcttcct 
cttaaatgaa 
aggtacgttg 
aacgacgcga 
attgattcaa 
tgcgatttta 
tggctttaaa 
tcaaaatagt 
agcagggttc 
tgcgttcgca 
tattttctcg 
aaatgttgac 
tactatagcg 
tcgaattaat 
gattgtgatt 
acaacaggaa 
tccagttatc 
aatgatgagt 
cttataccgt 
tttcgagcca 
taacggtcca 
tatcgattat 



60 

120 

180 

240 

300 

360 

420 

480 

54 0 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1398 



40 



45 



The amino acid sequence of S. aureus DnaB encoded by the dnaB gene 
corresponds to SEQ. ID. No. 6 as follows: 



Met Asp Arg Met Tyr Glu Gin Asn Gin Met Pro His Asn Asn Glu Ala 
15 10 15 

Glu Gin Ser Val Leu Gly Ser lie lie lie Asp Pro Glu Leu lie Asn 
20 25 30 



50 



55 



Thr Thr Gin Glu Val Leu Leu Pro Glu Ser Phe Tyr Arg Gly Ala His 

35 40 ' 45 

Gin His lie Phe Arg Ala Met Met His Leu Asn Glu Asp Asn Lys Glu 

50 55 60 

lie Asp Val Val Thr Leu Met Asp Gin Leu Ser Thr Glu Gly Thr Leu 

65 70 75 80 
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Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 
8 5 90 95 

Pro Thr Thr Arg Asn Val Gin Tyr Tyr Thr Asp He Val Ser Lys His 
100 105 110 

Ala Leu Lys Arg Arg Leu He Gin Thr Ala Asp Ser He Ala Asn Asp 
115 120 125 

Gly Tyr Asn Asp Glu Leu Glu Leu Asp Ala He Leu Ser Asp Ala Glu 
130 135 140 

Arg Arg He Leu Glu Leu Ser Ser Ser Arg Glu Ser Asp Gly Phe Lys 
145 150 155 160 

Asp He Arg Asp Val Leu Gly Gin Val Tyr Glu Thr Ala Glu Glu Leu 
165 170 175 

Asp Gin Asn Ser Gly Gin Thr Pro Gly He Pro Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp Gin Met Thr Ala Gly Phe Asn Arg Asn Asp Leu He He Leu 
195 200 205 

Ala Ala Arg Pro Ser Val Gly Lys Thr Ala Phe Ala Leu Asn He Ala 
210 215 220 

Gin Lys Val Ala Thr His Glu Asp Met Tyr Thr Val Gly He Phe Ser 
225 230 ~ 235 240 

Leu Glu Met Gly Ala Asp Gin Leu Ala Thr Arg Met He Cys Ser Ser 
245 250 ^ 255 

Gly Asn Val Asp Ser Asn Arg Leu Arg Thr Gly Thr Met Thr Glu Glu 
260 265 270 

Asp Trp Ser Arg Phe Thr He Ala Val Gly Lys Leu Ser Arg Thr Lys 
275 280 ' 285 

He Phe He Asp Asp Thr Pro Gly He Arg He Asn Asp Leu Arg Ser 
2 90 2 95 3 00 

Lys Cys Arg Arg Leu Lys Gin Glu His Gly Leu Asp Met He Val He 
305 310 315 320 

Asp Tyr Leu Gin Leu He Gin Gly Ser Gly Ser Arg Ala Ser Asp Asn 
325 330 335 

Arg Gin Gin Glu Val Ser Glu He Ser Arg Thr Leu Lys Ala Leu Ala 
340 345 350 

Arg Glu Leu Lys Cys Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Met Met Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 " 400 

Asp Asp Tyr Tyr Asn Arg Gly Gly Asp Glu Asp Asp Asp Asp Asp Gly 
405 ^ 410 415 
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Gly Phe Glu Pro Gin Thr Asn Asp Glu Asn Gly Glu lie Glu lie He 
420 425 430 

He Ala Lys Gin Arg Asn Gly Pro Thr Gly Thr Val Lys Leu His Phe 
5 435 440 445 

Met Lys Gin Tyr Asn Lys Phe Thr Asp He Asp Tyr Ala His Ala Asp 
450 455 460 

10 Met Met 

465 

The present invention also relates to the S. aureus polC gene (encoding 
Pol EI-L or a-large). The partial nucleotide sequence of this polC gene corresponds 
15 to SEQ. ID. No. 7 as follows: 



atgacagagc aacaaaaatt taaagtgctt gctgatcaaa ttaaaatttc aaatcaatta 60 
gatgctgaaa ttttaaattc aggtgaactg acacgtatag atgtttctaa caaaaacaga 120 
acatgggaat ttcatattac attaccacaa ttcttagctc atgaagatta tttattattt 180 

20 ataaatgcaa tagagcaaga gtttaaagat atcgccaacg ttacatgtcg ttttacggta 24 0 

acaaatggca cgaatcaaga tgaacatgca attaaatact ttgggcactg tattgaccaa 300 
acagctttat ctccaaaagt taaaggtcaa ttgaaacaga aaaagcttat tatgtctgga 3 60 
aaagtattaa aagtaatggt atcaaatgac attgaacgta atcattttga taaggcatgt 420 
aatggaagtc ttatcaaagc gtttagaaat tgtggttttg atatcgataa aatcatattc 480 

25 gaaacaaatg ataatgatca agaacaaaac ttagcttctt tagaagcaca tattcaagaa 54 0 

gaagacgaac aaagtgcacg attggcaaca gagaaacttg aaaaaatgaa agctgaaaaa 600 
gcgaaacaac aagataacaa cgaaagtgct gtcgataagt gtcaaattgg taagccgatt 660 
caaattgaaa atattaaacc aattgaatct attattgagg aagagtttaa agttgcaata 720 
gagggtgtca tttttgatat aaacttaaaa gaacttaaaa gtggtcgcca tatcgtagaa 780 

30 attaaagtga ctgactatac ggactcttta gttttaaaaa tgtttactcg taaaaacaaa 84 0 

gatgatttag aacattttaa agcgctaagt gttggtaaat gggttagggc tcaaggtcgt 900 
attgaagaag atacatttat tagagattta gttatgatga tgtctgatat tgaagagatt 960 
aaaaaagcga caaaaaaaga taaggctgaa gaaaagcgtg tagaattcca cttgcatact 1020 
gcaatgagcc aaatggatgg tatacccaat attggtgcgt atgttaaaca ggcagcagac 1080 

35 tggggacatc cagccattgc ggttacagac cataatgttg tgcaagcatt tccagatgct 114 0 

cacgcagcag cggaaaaaca tggcattaaa atgatatacg gtatggaagg tatgttagtt 1200 
gatgatggtg ttccgattgc atacaaacca caagatgtcg tattaaaaga tgctacttat 1260 
gttgtgttcg acgttgagac aactggttta tcaaatcagt atgataaaat catcgagctt 13 20 
gcagctgtga aagttcataa cggtgaaatc atcgataagt ttgaaaggtt tagtaatccg 1380 

40 catgaacgat tatcggaaac gattatcaat ttgacgcata ttactgatga tatgttagta 1440 

gatgcccctg agattgaaga agtacttaca gagtttaaag aatgggttgg cgatgcgata 1500 
ttcgtagcgc ataatgcttc gtttgatatg ggcttcatcg atacgggata tgaacgtctt 1560 
gggtttggac catcaacgaa tggtgttatc gatactttag aattatctcg tacgattaat 162 0 
actgaatatg gtaaacatgg tttgaatttc ttggctaaaa aatatggcgt agaattaacg 1680 

45 caacatcacc gtgccattta tgatacagaa gcaacagctt acattttcat aaaaatggtt 174 0 

caacaaatga aagaattagg cgtattaaat cataacgaaa tcaacaaaaa actcagtaat 18 00 
gaagatgcat ataaacgtgc aagacctagt catgtcacat taattgtaca aaaccaacaa 1860 
ggtcttaaaa atctatttaa aattgtaagt gcatcattgg tgaagtattt ctaccgtaca 1920 
cctcgaattc cacgttcatt gttagatgaa tatcgtgagg gattattggt aggtacagcg 1980 

50 tgtgatgaag gtgaattatt tacggcagtt atgcagaagg accagagtca agttgaaaaa 204 0 

attgccaaat attatgattt tattgaaatt caaccaccgg cactttatca agatttaatt 2100 
gatagagagc ttattagaga tactgaaaca ttacatgaaa tttatcaacg tttaatacat 2160 
gcaggtgaca cagcgggtat acctgttatt gcgacaggaa atgcacacta tttgtttgaa 2220 
catgatggta tcgcacgtaa aattttaata gcatcacaac ccggcaatcc acttaatcgc 22 80 

55 tcaactttac cggaagcaca ttttagaact acagatgaaa tgttaaacga gtttcatttt 234 0 

ttaggtgaag aaaaagcgca tgaaattgtt gtgaaaaata caaacgaatt agcagatcga 24 0 0 
attgaacgtg ttgttcctat taaagatgaa ttatacacac cgcgtatgga aggtgctaac 2460 
gaagaaatta gagaactaag ttatgcaaat gcgcgtaaac tgtatggtga agacctgcct 2520 
caaatcgtaa ttgatcgatt agaaaaagaa ttaaaaagta ttatcggtaa tggatttgcg 2580 

60 gtaatttact taatttcgca acgtttagtt aaaaaatcat tagatgatgg atacttagtt 2640 

ggttcccgtg gttcagtagg ttctagtttt gtagcgacaa tgactgagat tactgaagta 270 0 
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10 



15 



20 



25 



aacccgttac 
ggttcagtag 
cttattaaag 
gttcctgata 
aaagtattat 
aagactgctt 
gctgaaatag 
ccagggggta 
tatcctgccg 
catgataatg 
cttcaagatt 
cagatattta 
ggtacatttg 
aagccaacaa 
tggttaggca 
ggttgtcgtg 
tttaaaataa 
atgaaagaaa 
ttccctaaag 
gtacatcatc 
ttaatcacga 
cgctatatgg 
gaaatggcgc 
gaatttatca 
gaaaacgttg 
gatttaaaca 
tcattaccga 



cgccacacta 
gatcaggatt 
aaggacaaga 
tcgacttaaa 
ttggtgagga 
ttggttatgt 
atcgactcgt 
ttattgtagt 
atgatcaaaa 
tattaaaact 
tatcaggaat 
gtacacctga 
gggtaccaga 
cattttctga 
atgctcaaga 
atgatatcat 
tggagtcagt 
atgaagtgcc 
cccatgcagc 
cactttatta 
tgattaaaga 
atctaggtaa 
atcgaggtta 
ttgaaggcga 
cgaaacgaat 
aaaaagctgg 
atttaccaga 



tatttgtccg 
tgatttacct 
tattccgttt 
ctttagtggt 
taaagtattc 
taaaggttat 
taaaggatgt 
acctgattac 
ttcagcatgg 
tgatatactt 
tgatccaaaa 
aagtttgggt 
attcggtaca 
attagttcaa 
attaattaaa 
ggtttattta 
acgtaaaggt 
agattggtat 
agcatacgtt 
ctatgcatct 
taaaacaagc 
aaaagaaaaa 
tcgaatgcaa 
tacacttatt 
tgttgaagct 
attatctcag 
taaagctcaa 



aactgtaaaa 
gataagacgt 
gaaacatttt 
gaatatcaac 
cgtgcaggta 
ttgaatgatc 
acaggtgtta 
atggatattt 
atgacgacac 
ggacacgatg 
acaatacctg 
gttactgaag 
ggattcgtgc 
atctcaggat 
accggtatat 
atgtatgctg 
aaaggtttaa 
ttagattcat 
ttaatggcag 
tactttacaa 
attcgaaata 
gacgtattaa 
ccgattagtt 
ccgccgttca 
cgtgacgatg 
aaaattattg 
ctttcgatat 



cgagtgaatt 
gtgaaacttg 
taggatttaa 
cgaatgccca 
caattggtac 
aaggtatcca 
aacgtacaac 
atgattttac 
attttgattt 
atccaacaat 
tagatgataa 
atgaaatttt 
gtcaaatgtt 
tatctcatgg 
gtgatttatc 
gtttagaacc 
ctgaagaaat 
gtcttaaaat 
tacgtatcgc 
ttcgtgcgtc 
ctgtaaaaga 
cagtcttgga 
tagaaaagag 
tatcagtgcc 
gcccattttt 
agtatttaga 
ttgatatg 



tttcaatgat 
tggagcgcca 
ggg^g^taaa 

taactacaca 
tgttgctgaa 
caaaagaggt 
tggacagcat 
gccgatacaa 
ccattctatt 
gattcgtatg 
agaagttatg 
atgtaaaaca 
agaagataca 
tacagatgtg 
aagtgtaatt 
atcaatggct 
gattgaaacg 
taagtacatg 
atatttcaaa 
agactttgat 
catgtattct 
aattatgaat 
tcaggcgttc 
tgggcttggc 
atcaaaagaa 
tgagttaggc 



2760 
2820 
2880 
2940 
3O00 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4308 



30 



The amino acid sequence of the S. aureus polC gene product, ct-large, 
corresponds to SEQ. ID. No. 8 as follows: 



Met Thr 
1 



Glu Gin Gin Lys 
5 



35 



40 



45 



50 



55 



60 



Ser Asn Gin Leu Asp Ala 
20 

lie Asp Val Ser Asn Lys 
35 



Pro Gin 
50 

Glu Gin 
65 



Phe Leu Ala His 



Glu Phe Lys Asp 
70 



Thr Asn Gly Thr Asn Gin 
85 



Cys He 
Gin Lys 



Asn Asp 
130 

He Lys 
145 



Asp Gin Thr Ala 
100 

Lys Leu He Met 
115 

He Glu Arg Asn 



Ala Phe Arg Asn 
150 



Phe Lys Val 

Glu He Leu 
25 

Asn Arg Thr 
40 

Glu Asp Tyr 
55 

He Ala Asn 
Asp Glu His 



Leu Ser Pro 
105 

Ser Gly Lys 
120 

His Phe Asp 
135 

Cys Gly Phe 



Leu Ala 
10 

Asn Ser 



Trp Glu 

Leu Leu 

Val Thr 
75 

Ala He 
90 

Lys Val 
Val Leu 
Lys Ala 



Asp He 
155 



Asp Gin He 

Gly Glu Leu 
30 

Phe His He 
45 

Phe He Asn 
60 

Cys Arg Phe 
Lys Tyr Phe 



Lys Gly Gin 
110 

Lys Val Met 
125 

Cys Asn Gly 
140 

Asp Lys He 



Lys He 
15 

Thr Arg 

Thr Leu 

Ala He 

Thr Val 
80 

Gly His 
95 

Leu Lys 
Val Ser 
Ser Leu 



He Phe 
160 
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Glu Thr Asn Asp Asn Asp Gin Glu Gin Asn Leu Ala Ser Leu Glu Ala 
165 170 175 

His lie Gin Glu Glu Asp Glu Gin Ser Ala Arg Leu Ala Thr Glu Lys 
180 185 190 

Leu Glu Lys Met Lys Ala Glu Lys Ala Lys Gin Gin Asp Asn Lys Gin 
195 200 205 

Ser Ala Val Asp Lys Cys Gin He Gly Lys Pro He Gin He Glu Asn 
210 215 220 

He Lys Pro He Glu Ser He He Glu Glu Glu Phe Lys Val Ala lie 
225 230 235 240 

Glu Gly Val He Phe Asp He Asn Leu Lys Glu Leu Lys Ser Gly Arg 
245 ~ 250 255 

His He Val Glu He Lys Val Thr Asp Tyr Thr Asp Ser Leu Val Leu 
260 265 270 

Lys Met Phe Thr Arg Lys Asn Lys Asp Asp Leu Glu His Phe Lys Ala 
275 ~ 280 285 

Leu Ser Val Gly Lys Trp Val Arg Ala Gin Gly Arg lie Glu Glu Asp 
290 295 300 

Thr Phe He Arg Asp Leu Val Met Met Met Ser Asp He Glu Glu lie 
305 ~ 310 315 320 

Lys Lys Ala Thr Lys Lys Asp Lys Ala Glu Glu Lys Arg Val Glu Phe 
325 330 335 

His Leu His Thr Ala Met Ser Gin Met Asp Gly He Pro Asn He Gly 
340 345 350 

Ala Tyr Val Lys Gin Ala Ala Asp Trp Gly His Pro Ala He Ala Val 
355 360 365 

Thr Asp His Asn Val Val Gin Ala Phe Pro Asp Ala His Ala Ala Ala 
370 375 380 

Glu Lys His Gly He Lys Met He Tyr Gly Met Glu Gly Met Leu Val 
385 390 395 400 

Asp Asp Gly Val Pro He Ala Tyr Lys Pro Gin Asp Val Val Leu Lys 
405 410 415 

Asp Ala Thr Tyr Val Val Phe Asp Val Glu Thr Thr Gly Leu Ser Asn 
420 425 430 

Gin Tyr Asp Lys He He Glu Leu Ala Ala Val Lys Val His Asn Gly 
435 440 445 

Glu lie He Asp Lys Phe Glu Arg Phe Ser Asn Pro His Glu Arg Leu 
450 455 460 

Ser Glu Thr He He Asn Leu Thr His He Thr Asp Asp Met Leu Val 
465 470 475 480 

Asp Ala Pro Glu He Glu Glu Val Leu Thr Glu Phe Lys Glu Trp Val 
485 490 495 
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Gly Asp Ala lie Phe Val Ala His Asn Ala Ser Phe Asp Met Gly Phe 
500 505 510 

lie Asp Thr Gly Tyr Glu Arg Leu Gly Phe Gly Pro Ser Thr Asn Gly 
515 520 525 

Val lie Asp Thr Leu Glu Leu Ser Arg Thr He Asn Thr Glu Tyr Gly 
530 535 540 

Lys His Gly Leu Asn Phe Leu Ala Lys Lys Tyr Gly Val Glu Leu Thr 
545 550 555 560 

Gin His His Arg Ala He Tyr Asp Thr Glu Ala Thr Ala Tyr He Phe 
565 570 575 

He Lys Met Val Gin Gin Met Lys Glu Leu Gly Val Leu Asn His Asn 
580 585 590 

Glu He Asn Lys Lys Leu Ser Asn Glu Asp Ala Tyr Lys Arg Ala Arg 
595 600 605 

Pro Ser His Val Thr Leu He Val Gin Asn Gin Gin Gly Leu Lys Asn 
610 615 620 

Leu Phe Lys He Val Ser Ala Ser Leu Val Lys Tyr Phe Tyr Arg Thr 
625 630 635 640 

Pro Arg He Pro Arg Ser Leu Leu Asp Glu Tyr Arg Glu Gly Leu Leu 
645 650 655 

Val Gly Thr Ala Cys Asp Glu Gly Glu Leu Phe Thr Ala Val Met Gin 
660 665 670 

Lys Asp Gin Ser Gin Val Glu Lys He Ala Lys Tyr Tyr Asp Phe He 
675 680 685 

Glu He Gin Pro Pro Ala Leu Tyr Gin Asp Leu He Asp Arg Glu Leu 
690 695 700 

He Arg Asp Thr Glu Thr Leu His Glu He Tyr Gin Arg Leu He His 
705 710 715 720 

Ala Gly Asp Thr Ala Gly He Pro Val He Ala Thr Gly Asn Ala His 
725 730 735 

Tyr Leu Phe Glu His Asp Gly He Ala Arg Lys He Leu He Ala Ser 
740 745 750 

Gin Pro Gly Asn Pro Leu Asn Arg Ser Thr Leu Pro Glu Ala His Phe 
755 760 765 

Arg Thr Thr Asp Glu Met Leu Asn Glu Phe His Phe Leu Gly Glu Glu 
770 775 780 

Lys Ala His Glu He Val Val Lys Asn Thr Asn Glu Leu Ala Asp Arg 
785 790 795 800 

He Glu Arg Val Val Pro He Lys Asp Glu Leu Tyr Thr Pro Arg Met 
805 810 815 

Glu Gly Ala Asn Glu Glu He Arg Glu Leu Ser Tyr Ala Asn Ala Arg 
820 825 830 
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Lys Leu Tyr Gly Glu Asp Leu Pro Gin lie Val lie Asp Arg Leu Glu 
835 840 845 

Lys Glu Leu Lys Ser lie lie Gly Asn Gly Phe Ala Val lie Tyr Leu 
850 855 860 

lie Ser Gin Arg Leu Val Lys Lys Ser Leu Asp Asp Gly Tyr Leu Val 
865 870 875 880 

Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr Met Thr Glu 
885 890 895 

He Thr Glu Val Asn Pro Leu Pro Pro His Tyr He Cys Pro Asn Cys 
900 905 910 

Lys Thr Ser Glu Phe Phe Asn Asp Gly Ser Val Gly Ser Gly Phe Asp 
915 920 925 

Leu Pro Asp Lys Thr Cys Glu Thr Cys Gly Ala Pro Leu lie Lys Glu 
930 935 940 

Gly Gin Asp He Pro Phe Glu Lys Phe Leu Gly Phe Lys Gly Asp Lys 
945 950 955 960 

Val Pro Asp lie Asp Leu Asn Phe Ser Gly Glu Tyr Gin Pro Asn Ala 
965 970 975 

His Asn Tyr Thr Lys Val Leu Phe Gly Glu Asp Lys Val Phe Arg Ala 
980 985 990 

Gly Thr lie Gly Thr Val Ala Glu Lys Thr Ala Phe Gly Tyr Val Lys 
995 1000 1005 

Gly Tyr Leu Asn Asp Gin Gly He His Lys Arg Gly Ala Glu He Asp 
1010 1015 1020 

Arg Leu Val Lys Gly Cys Thr Gly Val Lys Ala Thr Thr Gly Gin His 
1025 1030 1035 1040 

Pro Gly Gly He He Val Val Pro Asp Tyr Met Asp He Tyr Asp Phe 
1045 1050 1055 

Thr Pro He Gin Tyr Pro Ala Asp Asp Gin Asn Ser Ala Trp Met Thr 
1060 1065 1070 

Thr His Phe Asp Phe His Ser He His Asp Asn Val Leu Lys Leu Asp 
1075 1080 1085 

He Leu Gly His Asp Asp Pro Thr Met He Arg Met Leu Gin Asp Leu 
1090 1095 1100 

Ser Gly He Asp Pro Lys Thr He Pro Val Asp Asp Lys Glu Val Met 
1105 1110 1115 1120 

Gin He Phe Ser Thr Pro Glu Ser Leu Gly Val Thr Glu Asp Glu He 
1125 1130 1135 

Leu Cys Lys Thr Gly Thr Phe Gly Val Pro Asn Ser Asp Arg He Arg 
1140 1145 1150 

Arg Gin Met Leu Glu Asp Thr Lys Pro Thr Thr Phe Ser Glu Leu Val 
1155 1160 1165 
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Gin lie Ser Gly Leu Ser His Gly Thr Asp Val Trp Leu Gly Asn Ala 
1170 1175 1180 

Gin Glu Leu He Lys Thr Gly He Cys Asp Leu Ser Ser Val He Gly 
1185 1190 H95 1200 

Cys Arg Asp Asp He Met Val Tyr Leu Met Tyr Ala Gly Leu Glu Pro 
1205 1210 1215 

Ser Met Ala Phe Lys He Met Glu Ser Val Arg Lys Gly Lys Gly Leu 
1220 1225 1230 

Thr Glu Glu Met He Glu Thr Met Lys Glu Asn Glu Val Pro Asp Trp 
1235 1240 1245 

Tyr Leu Asp Ser Cys Leu Lys He Lys Tyr He Phe Pro Lys Ala His 
1250 ^ 1255 1260 

Ala Ala Ala Tyr Val Leu Met Ala Val Arg He Ala Tyr Phe Lys Val 
1265 1270 1275 1280 

His His Pro Leu Tyr Tyr Tyr Ala Ser Tyr Phe Thr He Arg Ala Ser 
1285 1290 1295 

Asp Phe Asp Leu He Thr Met He Lys Asp Lys Thr Ser lie Arg Asn 
1300 1305 1310 

Thr Val Lys Asp Met Tyr Ser Arg Tyr Met Asp Leu Gly Lys Lys Glu 
1315 1320 1325 

Lys Asp Val Leu Thr Val Leu Glu He Met Asn Glu Met Ala His Arg 
1330 1335 1340 

Gly Tyr Arg Met Gin Pro He Ser Leu Glu Lys Ser Gin Ala Phe Glu 
1345 1350 1355 1360 

Phe He He Glu Gly Asp Thr Leu He Pro Pro Phe He Ser Val Pro 
1365 1370 1375 

Gly Leu Gly Glu Asn Val Ala Lys Arg He Val Glu Ala Arg Asp Asp 
1380 1385 1390 

Gly Pro Phe Leu Ser Lys Glu Asp Leu Asn Lys Lys Ala Gly Leu Tyr 
1395 1400 1405 

Gin Lys He He Glu Tyr Leu Asp Glu Leu Gly Ser Leu Pro Asn Leu 
1410 1415 1420 

Pro Asp Lys Ala Gin Leu Ser lie Phe Asp Met 
1425 1430 1435 

This invention also relates to the S. aureus dnaN gene encoding the 
beta subunit. The partial nucleotide sequence of this dnaN gene corresponds to SEQ. 
ID. No. 9 as follows: 



atgatggaat tcactattaa aagagattat tttattacac aattaaatga cacattaaaa 60 

gctatttcac caagaacaac attacctata ttaactggta tcaaaatcga tgcgaaagaa 120 

catgaagtta tattaactgg ttcagactct gaaatttcaa tagaaatcac tattcctaaa 180 

actgtagatg gcgaagatat tgtcaatatt tcagaaacag gctcagtagt acttcctgga 24 0 
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cgattctttg ttgatattat aaaaaaatta cctggtaaag atgttaaatt atctacaaat 300 

gaacaattcc agacattaat tacatcaggt cattctgaat ttaatttgag tggcttagat 360 

ccagatcaat atcctttatt acctcaagtt tctagagatg acgcaattca attgtcggta 420 

aaagtactta aaaacgtgat tgcacaaacg aattttgcag tgtccacctc agaaacacgc 480 

ccagtactaa ctggtgtgaa ctggcttata caagaaaatg aattaatatg cacagcgact 540 

gattcacacc gcttggctgt aagaaagttg cagttagaag atgtttctga aaacaaaaat 600 

gtcatcattc caggtaaggc tttagctgaa ttaaataaaa ttatgtctga caatgaagaa 660 

gacattgata tcttctttgc ttcaaaccaa gttttattta aagttggaaa tgtgaacttt 720 

atttctcgat tattagaagg acattatcct gatacaacac gtttattccc tgaaaactat 780 

gaaattaaat taagtataga caatggggag ttttatcatg cgattgatcg tgcctcttta 840 

ttagcacgtg aaggtggtaa taacgttatt aaattaagta caggtgatga cgttgttgaa 900 

ttatcttcta catcaccaga aattggtact gtaaaagaag aagttgatgc aaacgatgtt 960 

gaaggtggta gcctgaaaat ttcattcaac tctaaatata tgatggatgc tttaaaagca 1020 

atcgataatg atgaggttga agttgaattc ttcggtacaa tgaaaccatt tattctaaaa 1080 

ccaaaaggtg acgactcggt aacgcaatta attttaccaa tcagaactta ctaa 1134 

This amino acid sequence of S. aureus beta subunit is as follows (SEQ. 

ID. No. 10): 



Met Met Glu Phe Thr He Lys Arg Asp Tyr Phe He Thr Gin Leu Asn 
1 5 10 15 

Asp Thr Leu Lys Ala He Ser Pro Arg Thr Thr Leu Pro He Leu Thr 
20 25 30 

Gly He Lys He Asp Ala Lys Glu His Glu Val He Leu Thr Gly Ser 
35 40 45 

Asp Ser Glu He Ser He Glu He Thr He Pro Lys Thr Val Asp Gly 
50 55 60 

Glu Asp He Val Asn He Ser Glu Thr Gly Ser Val Val Leu Pro Gly 
65 70 75 80 

Arg Phe Phe Val Asp He He Lys Lys Leu Pro Gly Lys Asp Val Lys 
85 90 * 95 

Leu Ser Thr Asn Glu Gin Phe Gin Thr Leu He Thr Ser Gly His Ser 
100 105 no 

Glu Phe Asn Leu Ser Gly Leu Asp Pro Asp Gin Tyr Pro Leu Leu Pro 
115 120 125 

Gin Val Ser Arg Asp Asp Ala He Gin Leu Ser Val Lys Val Leu Lys 
130 135 140 

Asn Val He Ala Gin Thr Asn Phe Ala Val Ser Thr Ser Glu Thr Arg 
145 150 155 160 

Pro Val Leu Thr Gly Val Asn Trp Leu He Gin Glu Asn Glu Leu He 
165 170 175 

Cys Thr Ala Thr Asp Ser His Arg Leu Ala Val Arg Lys Leu Gin Leu 
180 185 "* 190 

Glu Asp Val Ser Glu Asn Lys Asn Val He He Pro Gly Lys Ala Leu 
195 200 205 

Ala Glu Leu Asn Lys He Met Ser Asp Asn Glu Glu Asp He Asp He 
210 215 220 
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Phe Phe Ala Ser Asn Gin Val Leu Phe Lys Val Gly Asn Val Asn Phe 

225 230 235 240 

lie Ser Arg Leu Leu Glu Gly His Tyr Pro Asp Thr Thr Arg Leu Phe 

5 245 250 " 255 

Pro Glu Asn Tyr Glu lie Lys Leu Ser lie Asp Asn Gly Glu Phe Tyr 

260 265 2 70 

10 His Ala lie Asp Arg Ala Ser Leu Leu Ala Arg Glu Gly Gly Asn Asn 

275 280 285 



15 



30 



Val He Lys Leu Ser Thr Gly Asp Asp Val Val Glu Leu Ser Ser Thr 
290 295 300 

Ser Pro Glu He Gly Thr Val Lys Glu Glu Val Asp Ala Asn Asp Val 
305 310 315 320 



Glu Gly Gly Ser Leu Lys He Ser Phe Asn Ser Lys Tyr Met Met Asp 
20 325 330 335 

Ala Leu Lys Ala He Asp Asn Asp Glu Val Glu Val Glu Phe Phe Gly 
340 345 350 

25 Thr Met Lys Pro Phe He Leu Lys Pro Lys Gly Asp Asp Ser Val Thr 

355 360 365 



Gin Leu He Leu Pro He Arg Thr Tyr 
370 375 

This invention also relates to the & aureus holA gene encoding the 
delta subunit. The partial nucleotide sequence of this holA gene corresponds to SEQ. 
ED. No. 1 1 as follows: 



35 atggatgaac agcaacaatt gacgaatgca tatcattcaa ataaattatc gcatgcctat 60 

ttatttgaag gtgatgatgc acaaacgatg aaacaagttg cgattaattt tgcaaagctt 120 

attttatgtc aaacagatag tcaatgtgaa acaaaggtta gtacatataa tcatccagac 180 

tttatgtata tatcaacaac tgagaatgca attaagaaag aacaagttga acaacttgtg 240 

cgtcatatga atcaacttcc tatagaaagc acaaataaag tgtacatcat cgaagacttt 300 

40 gaagactttg aaaagttaac tgttcaaggg gaaaacagta tcttgaaatt tcttgaagaa 3 60 

ccaccggaca atacgattgc tattttattg tctacaaaac ctgagcaaat tttagacaca 420 

atccattcaa ggtgtcagca tgtatatttc aagcctattg ataaagaaaa gtttataaat 480 

agattagttg aacaaaacat gtctaagcca gtagctgaaa tgattagtac ttatactacg 54 0 

caaatagata atgcaatggc tttaaatgaa gaatttgatt tattagcatt aaggaaatca 600 

45 gttatacgtt gggaattgtt gcttactaat aagccaatgg cacttatagg tattattgat 660 

ttattgaaac aggctaaaaa taaaaaactg caatctttaa ctattgcagc tgtgaatggt 720 

ttcttcgaag atatcataca tacaaaggta aatgtagagg ataaacaaat atatagtgat 780 

ttaaaaaatg atattgatca atatgcgcaa aagttgtcgt ttaatcaatt aattttgatg 840 

tttgatcaac tgacggaagc acataagaaa ttgaatcaaa atgtaaatcc aacgcttgta 900 

50 tttgaacaaa tcgtaattaa gggtgtgagt 93 0 

The amino acid sequence of the delta subunit encoded by S. aureus 
holA corresponds to SEQ. ID. No. 12 as follows: 

55 Met Asp Glu Gin Gin Gin Leu Thr Asn Ala Tyr His Ser Asn Lys Leu 

15 10 15 
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Ser His Ala Tyr Leu Phe Glu Gly Asp Asp Ala Gin Thr Met Lys Gin 
20 25 30 

Val Ala lie Asn Phe Ala Lys Leu lie Leu Cys Gin Thr Asp Ser Gin 
35 40 45 

Cys Glu Thr Lys Val Ser Thr Tyr Asn His Pro Asp Phe Met Tyr lie 
50 55 60 

Ser Thr Thr Glu Asn Ala lie Lys Lys Glu Gin Val Glu Gin Leu Val 
65 70 75 80 

Arg His Met Asn Gin Leu Pro lie Glu Ser Thr Asn Lys Val Tyr lie 
85 90 95 

lie Glu Asp Phe Glu Asp Phe Glu Lys Leu Thr Val Gin Gly Glu Asn 
100 105 110 

Ser lie Leu Lys Phe Leu Glu Glu Pro Pro Asp Asn Thr lie Ala lie 
115 120 125 

Leu Leu Ser Thr Lys Pro Glu Gin lie Leu Asp Thr lie His Ser Arg 
130 135 140 

Cys Gin His Val Tyr Phe Lys Pro lie Asp Lys Glu Lys Phe lie Asn 
145 150 155 160 

Arg Leu Val Glu Gin Asn Met Ser Lys Pro Val Ala Glu Met lie Ser 
165 170 175 

Thr Tyr Thr Thr Gin lie Asp Asn Ala Met Ala Leu Asn Glu Glu Phe 
180 185 190 

Asp Leu Leu Ala Leu Arg Lys Ser Val lie Arg Trp Glu Leu Leu Leu 
195 200 205 

Thr Asn Lys Pro Met Ala Leu lie Gly lie lie Asp Leu Leu Lys Gin 
210 215 220 

Ala Lys Asn Lys Lys Leu Gin Ser Leu Thr lie Ala Ala Val Asn Gly 
225 230 235 240 

Phe Phe Glu Asp lie lie His Thr Lys Val Asn Val Glu Asp Lys Gin 
245 250 255 

lie Tyr Ser Asp Leu Lys Asn Asp lie Asp Gin * Tyr Ala Gin Lys Leu 
260 265 ~ 270 

Ser Phe Asn Gin Leu lie Leu Met Phe Asp Gin Leu Thr Glu Ala His 
275 280 285 

Lys Lys Leu Asn Gin Asn Val Asn Pro Thr Leu Val Phe Glu Gin lie 
290 295 300 

Val lie Lys Gly Val Ser 
305 310 

This invention also relates to the S. aureus holB gene encoding the 
delta prime subunit. The partial nucleotide sequence of this holB gene corresponds to 
SEQ. ID. No. 13 as follows: 
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atgagcgaca atattgtagc tatttatgga gatgtgcctg aattggttga aaaacaaagt 60 
gcagaaatca tatcacaatt tttgaaaagt gatagagatg actttaactt tgtgaaatat 12 0 
aatttatacg aaacagagat tgcaccaatt gttgaagaaa cattaacatt gcctttcttt 180 
tcagataaaa aagcaatttt ggttaaaaat gcatatatat ttacaggtga aaaagcgcca 24 0 
aaagatatgg ctcataatgt agaccaatta atagaattta ttgaaaaata tgatggcgaa 3 00 
aatttgattg tctttgagat atatcaaaat aaacttgatg aaagaaaaaa gttaactaaa 360 
actctaaaaa agcatgcaag gcttaaaaaa atagagcaga tgtcggagga gatcaagtgg 420 
attcaaaaaa aagaacaagc gattgatttt gtaaaagatc ttataacaat gaaagaagaa 480 
ccaattaaac ttcttgcact tacatcaaat tatagacttt tttatcaatg taaaattctt 540 
tcacaaaaag gttatagtgg tcaacaaatt gcaaaaacaa taggtgttca tccatataga 600 
gtgaaacttg cacttggtca agtgagacat tatcaacttg atgaacttct taatattatt 660 
gatgcatgtg cagaaacaga ttataaactt aaatcatcat atatggataa acaacttatt 720 
cttgaacttt ttattctttc actt 744 



The amino acid sequence of the delta prime subunit encoded by S. 
aureus holB corresponds to SEQ. ED. No. 14 as follows: 



Met Ser Asp Asn 
1 

Glu Lys Gin Ser 
20 

Asp Asp Phe Asn 
35 

Pro lie Val Glu 
50 

Ala lie Leu Val 
65 



Lys Asp Met Ala 



Tyr Asp Gly Glu 
100 

Asp Glu Arg Lys 
115 

Lys Lys lie Glu 
130 

Glu Gin Ala lie 
145 

Pro lie Lys Leu 



Cys Lys lie Leu 
180 

Thr lie Gly Val 
195 

Arg His Tyr Gin 
210 



lie Val Ala He 
5 

Ala Glu He He 



Phe Val Lys Tyr 
40 

Glu Thr Leu Thr 
55 

Lys Asn Ala Tyr 
70 

His Asn Val Asp 
85 

Asn Leu He Val 



Lys Leu Thr Lys 
120 

Gin Met Ser Glu 
135 

Asp Phe Val Lys 
15 0 

Leu Ala Leu Thr 
165 

Ser Gin Lys Gly 



His Pro Tyr Arg 
200 

Leu Asp Glu Leu 
215 



Tyr Gly Asp Val 
10 

Ser Gin Phe Leu 
25 

Asn Leu Tyr Glu 



Leu Pro Phe Phe 
60 

He Phe Thr Gly 
75 

Gin Leu He Glu 
90 

Phe Glu He Tyr 
105 

Thr Leu Lys Lys 



Glu He Lys Trp 
140 

Asp Leu He Thr 
155 

Ser Asn Tyr Arg 
170 

Tyr Ser Gly Gin 
185 

Val Lys Leu Ala 



Leu Asn He He 
220 



Pro Glu Leu Val 
15 

Lys Ser Asp Arg 
30 

Thr Glu He Ala 
45 

Ser Asp Lys Lys 



Glu Lys Ala Pro 
80 

Phe He Glu Lys 
95 

Gin Asn Lys Leu 
110 

His Ala Arg Leu 
125 

He Gin Lys Lys 



Met Lys Glu Glu 
160 

Leu Phe Tyr Gin 
175 

Gin He Ala Lys 
190 

Leu Gly Gin Val 
205 

Asp Ala Cys Ala 
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Glu Thr Asp Tyr Lys Leu Lys Ser Ser Tyr Met Asp Lys Gin Leu lie 
225 230 235 240 

Leu Glu Leu Phe lie Leu Ser Leu 
245 



This invention also relates to the S. aureus dnaG gene encoding a 
primase. The partial nucleotide sequence of this dnaG gene corresponds to SEQ. ID. 
No. 15 as follows: 



10 



15 



20 



25 



30 



35 



40 



atgataggtt 
aaacaaatct 
gaaattaaag 
gctgtagata 
caaatgattg 
gtcgaaggcg 
aaagagcgag 
aaaaagggtt 
aatttcagtt 
ggaagaattg 
agtcctgaaa 
aaatcaatta 
tctgatactg 
catattacct 
gcgggtagtg 
tttgttatac 
gacgcattta 
atattaaaag 
agtcatgaca 
gcgccatttt 
ccagccaatt 
gaacctgagc 
cgagcatttt 
gttgataagg 
ttttatgcgg 
gagttgagag 
aatgaaattg 
ttgaatcata 
ttacagcaaa 



tgtgtccttt 
gtcattgttt 
acatatcatt 
ttgaggcaac 
aaatgcatga 
aacaagcatt 
gcattggctt 
acgatattga 
attacgatag 
ttggatattc 
cgcctatctt 
gaaaattaga 
ctggcttgaa 
ttatacgaaa 
aagcaacact 
aattgccatc 
ctacttttgt 
atgaaattgc 
tttcacttat 
tcaatgttag 
attatccaga 
caattggtat 
taaaacattt 
ataacttcac 
aaaatgatca 
aaacactaat 
atgattatgt 
aattaaggga 
ttgttgctaa 



tcatgatgaa 
tggttgtaaa 
tgttgaagcg 
acaatctaac 
gttaatacaa 
aacatactta 
tgcacccgat 
attagcatat 
atttcgaaat 
aggtcgaaca 
tcaaaaaaga 
tgaaattgta 
aaacgttgtt 
gttaacatca 
taaaacaggt 
tggcatggat 
aaaaaatgac 
acataatgac 
gaagtcatca 
tcctgagcag 
agatgagtat 
ggcacaattt 
aatgagagat 
aaatcagcat 
atataatatc 
tagcttagaa 
caatgttatt 
agctacaagg 
gaataaagaa 



aagacacctt 
aaaggtggca 
gttaaagaat 
tcaaatgttc 
gaattttatt 
caagaacgtg 
agctcacatt 
gaagccggat 
cgtattatgt 
tataccggtc 
aagttgttat 
ttactagaag 
gcaacaatgg 
aatataacat 
caacatttgt 
ccggatgaat 
aaaaagtcat 
ctttcatatg 
attctgcaac 
ttagctaacg 
ggcggttatg 
gacaatttga 
aaagatacat 
tttaaatatg 
agtgatgctg 
caatataatt 
aatgaaaaag 
at tggcgatg 
cgcatgtag 



catttacagt 
atgtttttca 
taggtgatag 
aaattgcttc 
attacgcttt 
gttttacaga 
tttgtcatga 
tattatcacg 
ttcctttgaa 
aagaaccaaa 
ataacttaga 
gttttatgga 
gcacacagtt 
taatgtttga 
tacagcaagg 
acattggtaa 
ttgcacatta 
aacgttattt 
aaaaggctat 
aaatacaatt 
atgagtatgg 
gccgtcgaga 
ttttaaatta 
tattcgaagt 
tgcagtatgt 
tgaatggcga 
gacaagaaac 
tagaattaca 



ttctgaagat 
atttactcaa 
agttaatgtt 
tgatgattta 
aacaaagaca 
tgcgcttatt 
ttttcttcaa 
taacgaagaa 
aaatgcgcaa 
atacctaaat 
taaagcacgt 
tgttataaaa 
gtcagatgaa 
tggggatttt 
gctaaatgta 
gtatggcaac 
taaagtaagt 
gaaagaactg 
aaatgatgtt 
caatcaagca 
cggttatatt 
aaaagcggag 
ttatgaaagt 
cttacatgat 
taattcaaat 
accatatgaa 
aattgagtca 
aaaatactat 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1719 



The amino acid sequence of primase encoded by S. aureus dnaG 
corresponds to SEQ. ID. No. 16 as follows: 



Met lie Gly Leu Cys Pro Phe His Asp Glu Lys Thr Pro Ser Phe Thr 
45 1 5 10 ^ 15 

Val Ser Glu Asp Lys Gin lie Cys His Cys Phe Gly Cys Lys Lys Gly 
20 25 30 

50 Gly Asn Val Phe Gin Phe Thr Gin Glu lie Lys Asp lie Ser Phe Val 

35 40 45 

Glu Ala Val Lys Glu Leu Gly Asp Arg Val Asn Val Ala Val Asp lie 

55 



WO 01/09164 



-45- 



PCT/USOO/20666 



Glu Ala Thr Gin Ser Asn Ser Asn Val Gin lie Ala Ser Asp Asp Leu 
65 70 75 80 

Gin Met lie Glu Met His Glu Leu lie Gin Glu Phe Tyr Tyr Tyr Ala 
85 90 95 

Leu Thr Lys Thr Val Glu Gly Glu Gin Ala Leu Thr Tyr Leu Gin Glu 
100 105 no 

Arg Gly Phe Thr Asp Ala Leu lie Lys Glu Arg Gly lie Gly Phe Ala 
115 120 125 

Pro Asp Ser Ser His Phe Cys His Asp Phe Leu Gin Lys Lys Gly Tyr 
130 135 140 

Asp lie Glu Leu Ala Tyr Glu Ala Gly Leu Leu Ser Arg Asn Glu Glu 
145 150 155 160 

Asn Phe Ser Tyr Tyr Asp Arg Phe Arg Asn Arg lie Met Phe Pro Leu 
165 170 175 

Lys Asn Ala Gin Gly Arg lie Val Gly Tyr Ser Gly Arg Thr Tyr Thr 
180 185 190 

Gly Gin Glu Pro Lys Tyr Leu Asn Ser Pro Glu Thr Pro lie Phe Gin 
195 200 205 

Lys Arg Lys Leu Leu Tyr Asn Leu Asp Lys Ala Arg Lys Ser lie Arg 
210 215 220 

Lys Leu Asp Glu lie Val Leu Leu Glu Gly Phe Met Asp Val He Lys 
225 230 235 240 

Ser Asp Thr Ala Gly Leu Lys Asn Val Val Ala Thr Met Gly Thr Gin 
245 250 255 

Leu Ser Asp Glu His He Thr Phe He Arg Lys Leu Thr Ser Asn He 
260 265 270 

Thr Leu Met Phe Asp Gly Asp Phe Ala Gly Ser Glu Ala Thr Leu Lys 
275 280 285 

Thr Gly Gin His Leu Leu Gin Gin Gly Leu Asn Val Phe Val He Gin 
290 295 300 

Leu Pro Ser Gly Met Asp Pro Asp Glu Tyr He Gly Lys Tyr Gly Asn 
305 310 315 320 

Asp Ala Phe Thr Thr Phe Val Lys Asn Asp Lys Lys Ser Phe Ala His 
325 330 ~ 335 

Tyr Lys Val Ser He Leu Lys Asp Glu He Ala His Asn Asp Leu Ser 
340 345 350 

Tyr Glu Arg Tyr Leu Lys Glu Leu Ser His Asp He Ser Leu Met Lys 
355 360 365 

Ser Ser He Leu Gin Gin Lys Ala He Asn Asp Val Ala Pro Phe Phe 
370 375 380 

Asn Val Ser Pro Glu Gin Leu Ala Asn Glu lie Gin Phe Asn Gin Ala 
385 390 395 400 
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Pro Ala Asn Tyr Tyr Pro Glu Asp Glu Tyr Gly Gly Tyr Asp Glu Tyr 
405 ~ 410 415 

Gly Gly Tyr lie Glu Pro Glu Pro lie Gly Met Ala Gin Phe Asp Asn 
5 420 425 430 

Leu Ser Arg Arg Glu Lys Ala Glu Arg Ala Phe Leu Lys His Leu Met 
435 440 445 

10 Arg Asp Lys Asp Thr Phe Leu Asn Tyr Tyr Glu Ser Val Asp Lys Asp 

45 0 455 460 



15 



30 



Asn Phe Thr Asn Gin His Phe Lys Tyr Val Phe Glu Val Leu His Asp 

465 470 475 480 

Phe Tyr Ala Glu Asn Asp Gin Tyr Asn lie Ser Asp Ala Val Gin Tyr 

485 490 495 



Val Asn Ser Asn Glu Leu Arg Glu Thr Leu lie Ser Leu Glu Gin Tyr 
20 500 505 510 

Asn Leu Asn Gly Glu Pro Tyr Glu Asn Glu lie Asp Asp Tyr Val Asn 
515 520 525 

25 Val lie Asn Glu Lys Gly Gin Glu Thr lie Glu Ser Leu Asn His Lys 

530 535 540 



Leu Arg Glu Ala Thr Arg lie Gly Asp Val Glu Leu Gin Lys Tyr Tyr 
545 550 555 560 

Leu Gin Gin lie Val Ala Lys Asn Lys Glu Arg Met 

565 570 



This invention also relates to the polC gene of Streptococcus pyogenes 
35 encoding the a-large subunit. The partial nucleotide sequence of polC (a-large) 

corresponds to SEQ. ED. No. 17 as follows: 



atgtcagatt tattcgctaa attgatggac cagatagaaa tgccacttga catgagacgt 60 

tcaagtgcct tttcatctgc tgatattatc gaggtaaagg tacattcggt gtcacgcttg 120 

40 tgggaatttc attttgcctt tgcagcggtt ttaccgattg caacttatcg tgaattgcat 180 

gatcgtttga taagaacttt tgaggcggct gacattaagg taacctttga catccaagct 240 

gctcaggtgg attattcaga tgatctgctt caagcttatt accaagaagc ttttgagcat 3 00 

gcaccgtgta atagtgctag ttttaaatct tctttctcaa agctcaaagt gacttatgag 360 

gatgacaaac tcattattgc agcgccaggt tttgtgaata acgatcattt tagaaacaat 420 

45 catctgccta atctggtcaa gcaattagaa gcctttggct ttggcatctt gaccatagat 4 80 

atggtgtcag atcaggaaat gactgagcat ttgaccaaga attttgtttc cagtcgtcag 54 0 

gctcttgtga aaaaggctgt gcaggataat ttggaagccc aaaaatctct tgaagccatg 600 

atgccaccag ttgaggaagc cacacctgct cctaagtttg actacaagga acgagcagct 660 

aagcgtcagg cagggtttga aaaagcaacc atcacaccaa tgattgagat tgagaccgaa 72 0 

50 gaaaaccgga ttgtctttga gggtatggtt tttgacgtgg agcgtaaaac gactaggaca 780 

ggtcgccata tcatcaactt taaaatgaca gactatacct cctcgtttgc tctccaaaaa 840 

tgggctaaag acgatgagga gctccgtaaa tttgatatga ttgctaaggg agcttggtta 900 

cgggtacaag ggaatattga gaccaatcct tttacgaaga gtctcaccat gaatgtccag 960 

caggtcaaag aaattgtccg tcatgagcgc aaagacctga tgccagaagg gcaaaagcgg 1020 

55 gtcgaacttc atgcccacac caatatgtct accatggatg ccttaccgac agtagaaagc 1080 

ttgattgata cggcagccaa gtggggacac aaggcgattg ctatcaccga ccatgctaat 114 0 

gtgcaaagtt ttcctcatgg ctaccatagg gctcgcaaag ctgggattaa ggctattttt 1200 

ggcctagaag ccaatattgt tgaggacaag gtgcctattt cttatgaacc tgttgatatg 1260 

gatttgcacg aagccaccta tgtggtcttt gacgtggaaa ccacaggtct atctgctatg 1320 

60 aataatgacc tgattcagat tgcggcttcc aaaatgttta aaggaaatat tgtagagcag 1380 
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25 
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35 



40 



45 



50 



tttgatgaat 
attaccgata 
gacttttgca 
aacgccaatt 
gaatttgcta 
cgtttccaag 
cgtcttttgt 
caactcaata 
actatctatg 
aatatcaaat 
gagggtttgt 
aaaggaattg 
ccagccattt 
caggtgattc 
gggaatgtgc 
cttggtcagg 
cccctaccta 
ggaaaagacc 
gaggaagtgg 
acggttgccg 
attattgatt 
atttatctcg 
tctaggggat 
cctatgccgc 
tcagttggat 
caaaaagatg 
cccgatattg 
gatatttttg 
acagcttatg 
gaggtggatc 

ggggggattg 

ccagccgatg 
gaaaacgtct 
caggatttat 
ctcttttctg 
atgctaggca 
ccgaccactt 
cttggtaatg 
tgtcgtgacg 
accattatgg 
ggctatattg 
aaaatcaagt 
gtggcttatt 
gcgaaggctt 
gaagatatta 
acaaccttgg 
ctttacaaaa 
atagcgctag 
ggcgaattcc 
gagaaaatgg 
tttgatgact 



tcattgatcc 
agcatttgca 
aagatagtat 
atgaacgcca 
gaaacttgta 
tgagtctaga 
ttatttttct 
cagatttggt 
tgcaaaatca 
attttgaagg 
tacttggtac 
atgcagcggt 
accagccatt 
gtgacctcat 
attatctaga 
gtgccatgat 
aagcgcactt 
tcgcttatca 
aagtggttaa 
aattaaccta 
tacgcattga 
cttcccaaat 
ctgtagggtc 
ctcactacgt 
ctggctatga 
ggcaagacat 
atttgaactt 
gtgacgaata 
gatttgtcaa 
gtctagcagc 
ttgttattcc 
atgtaacggc 
tgaaacttga 
cgggcattga 
ggacagaggt 
ttccagaatt 
ttgcggagct 
cacaagattt 
acatcatggt 
agcgtgtgcg 
atgccatgcg 
acatgttccc 
tcaaggtgca 
ttgaattaaa 
ctataaaacg 
agattgtcaa 
gtgatgctat 
aaggtctggg 
tctctaaaat 
atgagatggg 
ttttc 



tgggcatcct 
gggcgccaag 
cttggttgcc 
cgacttgccc 
tcctgagtac 
ccaccatcat 
aaaagatgcc 
ggctgaggat 

ggttggtctt 

ggtgccgcgt 
agcttgttct 
tgatttggct 
ggttgtccgt 
tgaagtaggg 
gcctgaagaa 
taatagaaca 
tagaacaacc 
agtagttgtg 

gggcgatctt 

tcaaaaagcc 
aaaagagtta 
gcttgttaac 
tagctttgtc 
ttgcccgtcc 
tttgcctaat 
tccctttgag 
ctctggtgat 
cgcctttcgt 
aggctatgaa 
aggtgctgct 
taattacatg 
ttcttggcag 
tatcctaggg 
tcctattact 
tttgggcgtt 
tggaaccaac 
tttgcagttg 
gattaaagaa 
ttacctcatg 
taagggatta 
agaaaacaat 
taaagcccat 
ccaccccatt 
aaccatgagt 
taaaaataat 
cgaaatgtta 
agaattccaa 
tgaaaacgtg 
ggaattgcgt 
tattttagga 



ctttcagcct 
ccattggtta 
cacaacgcca 
aaaatcacac 
aagcgtcacg 
atggccaatt 
agagaaaagc 
tcttacaaaa 
aaaaatatgt 
attccaagaa 
gacggcgagg 
aggtattatg 
gaattaatca 
aaacgagcta 
gagatttacc 
atcggccgtg 
aatgaaatgc 
caaaatactc 
tacaccccgt 
tttgaaattt 
acctctatct 
cggtcaaatg 
gccaccatga 
tgccaacatt 
aaaccctgtc 
acctttcttg 
gaccagccca 
gctggaacag 
cgcgactatg 
ggtgtgaaac 
gatgtttatg 
acaactcact 
catgatgatc 
attcctgctg 
accccggaac 
tttgttcgcg 
tctggactat 
ggcattgcaa 
cacgcaggct 
tggctaaaaa 
gtgcccgact 
gcggcagctt 
atgtattatt 

ggtggtttag 

gaagccacca 
gaacgcggct 
atcaaaggag 
gccaagcaaa 
aaacgaggcg 
aatatgccag 



ttaccaccga 
ctgtcctaaa 
gttttgacgt 
agcctgtgat 
gtttgggacc 
acgacgcgga 
atggcatcaa 
aagcgcggat 
ttaagttggt 
ccgtcttaga 
tttttgatgc 
attttatcga 
aagatcaagc 
agaaacctgt 
gtgaaattat 
gggaaggggc 
tggatgagtt 
aggattttgc 
atattgataa 
atggtaatcc 
tggggaacgg 
agcgaggcta 
ttgggattac 
ctgaatttat 
cgaaatgtgg 
ggtttgatgg 
gtgcccattt 
ttggtaccgt 
gcaagttcta 
gaacgactgg 
attttacccc 
ttaacttcca 
cgaccatgat 
atgatccggg 
aaattgggac 
gcatggttaa 
ctcatggaac 
ccctaaaaac 
tagaaccaaa 
tttctgagga 
ggtacattga 
atgttttgat 
gtgcttattt 
atgctgttaa 
atgtggaaaa 
ttaagtttgg 
atacccttat 
tcgttaaagc 
gggcatcgtc 
aagataatca 



attgacagga 
agcttttcag 
gggctttatg 
tgatacctta 
gctcaccaag 
agccacagga 
aaatcttttg 
taagcatgcg 
cagcctttcc 
tgctcacaga 
cgttctgact 
aatcatgcca 
aggtattgag 
gcttgccact 
tgtgcgtagt 
acagcctgct 
tgcctttctt 
ggaccgtatt 
ggccgaagag 
tctcccagat 
ttttgctgtg 
cctagttggt 
tgaggttaat 
cacagatggg 
caccccttat 
ggataaggtg 
ggatgtccga 
agcagaaaaa 
tcgtgatgct 
gcagcaccct 
cgtgcaatat 
tgatattgat 
tcgtaaactt 
agttatggct 
accgactggt 
tgagacgcat 
cgatgtttgg 
cgttatcggt 
aatggccttt 
agaacgtaat 
atcgtgtgga 
ggcccttcgg 
ctctattcgt 
agcaagaatg 
tgacctcttt 
caaattagac 
ccctccattt 
tcgtcaagaa 
aacgctcgtt 
attaagtctt 



1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4395 



55 



The encoded cc-large subunit has an amino acid sequence corresponding to SEQ. ID. 
No. 18 as follows: 



Met Ser Asp Leu Phe Ala Lys Leu Met Asp Gin lie Glu Met Pro Leu 
15 10 15 



Asp Met Arg Arg Ser Ser Ala Phe Ser Ser Ala Asp lie lie Glu Val 
60 20 25 30 
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Lys Val His Ser 
35 

Ala Val Leu Pro 
50 

Arg Thr Phe Glu 
65 

Ala Gin Val Asp 



Ala Phe Glu His 
100 

Ser Lys Leu Lys 
115 

Pro Gly Phe Val 
130 

Leu Val Lys Gin 
145 

Met Val Ser Asp 



Ser Ser Arg Gin 
180 

Ala Gin Lys Ser 
195 

Pro Ala Pro Lys 
210 

Gly Phe Glu Lys 
225 

Glu Asn Arg lie 



Thr Thr Arg Thr 
260 

Thr Ser Ser Phe 
275 

Arg Lys Phe Asp 
290 

Asn lie Glu Thr 
305 

Gin Val Lys Glu 



Gly Gin Lys Arg 
340 

Asp Ala Leu Pro 
355 



Val Ser Arg Leu 
40 

lie Ala Thr Tyr 
55 

Ala Ala Asp lie 
70 

Tyr Ser Asp Asp 
85 

Ala Pro Cys Asn 



Val Thr Tyr Glu 
120 

Asn Asn Asp His 
135 

Leu Glu Ala Phe 
150 

Gin Glu Met Thr 
165 

Ala Leu Val Lys 



Leu Glu Ala Met 
200 

Phe Asp Tyr Lys 
215 

Ala Thr lie Thr 
230 

Val Phe Glu Gly 
245 

Gly Arg His lie 



Ala Leu Gin Lys 
280 

Met lie Ala Lys 

295 

Asn Pro Phe Thr 
310 

lie Val Arg His 

325 

Val Glu Leu His 



Thr Val Glu Ser 
360 



Trp Glu Phe His 



Arg Glu Leu His 
60 

Lys Val Thr Phe 
75 

Leu Leu Gin Ala 
90 

Ser Ala Ser Phe 
105 

Asp Asp Lys Leu 



Phe Arg Asn Asn 
140 

Gly Phe Gly He 
155 

Glu His Leu Thr 
170 

Lys Ala Val Gin 
185 

Met Pro Pro Val 



Glu Arg Ala Ala 
220 

Pro Met He Glu 
235 

Met Val Phe Asp 
250 

He Asn Phe Lys 
265 

Trp Ala Lys Asp 



Gly Ala Trp Leu 
300 

Lys Ser Leu Thr 
315 

Glu Arg Lys Asp 
330 

Ala His Thr Asn 
345 

Leu He Asp Thr 



Phe Ala Phe Ala 
45 

Asp Arg Leu He 



Asp He Gin Ala 
80 

Tyr Tyr Gin Glu 
95 

Lys Ser Ser Phe 
110 

He He Ala Ala 
125 

His Leu Pro Asn 



Leu Thr He Asp 
160 

Lys Asn Phe Val 
175 

Asp Asn Leu Glu 
190 

Glu Glu Ala Thr 
205 

Lys Arg Gin Ala 



He Glu Thr Glu 
240 

Val Glu Arg Lys 
255 

Met Thr Asp Tyr 
270 

Asp Glu Glu Leu 
285 

Arg Val Gin Gly 



Met Asn Val Gin 
320 

Leu Met Pro Glu 
335 

Met Ser Thr Met 
350 

Ala Ala Lys Trp 
365 
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Gly His Lys Ala lie 
370 

Pro His Gly Tyr His 
385 

Gly Leu Glu Ala Asn 
405 

Pro Val Asp Met Asp 
420 

Glu Thr Thr Gly Leu 
435 

Ala Ser Lys Met Phe 
450 

lie Asp Pro Gly His 
465 

lie Thr Asp Lys His 
485 

Lys Ala Phe Gin Asp 
500 

Ala Ser Phe Asp Val 
515 

Leu Pro Lys lie Thr 
530 

Asn Leu Tyr Pro Glu 
545 

Arg Phe Gin Val Ser 
565 

Glu Ala Thr Gly Arg 
580 

Lys His Gly lie Lys 
595 

Glu Asp Ser Tyr Lys 
610 

Gin Asn Gin Val Gly 
625 

Asn lie Lys Tyr Phe 
645 

Asp Ala His Arg Glu 
660 

Glu Val Phe Asp Ala 
675 

Leu Ala Arg Tyr Tyr 
690 



Ala lie Thr Asp His Ala 
375 

Arg Ala Arg Lys Ala Gly 
390 395 

lie Val Glu Asp Lys Val 
410 

Leu His Glu Ala Thr Tyr 
425 

Ser Ala Met Asn Asn Asp 
440 

Lys Gly Asn lie Val Glu 
455 

Pro Leu Ser Ala Phe Thr 
470 475 

Leu Gin Gly Ala Lys Pro 
490 

Phe Cys Lys Asp Ser lie 
505 

Gly Phe Met Asn Ala Asn 
520 

Gin Pro Val lie Asp Thr 
535 

Tyr Lys Arg His Gly Leu 
550 555 

Leu Asp His His His Met 
570 

Leu Leu Phe lie Phe Leu 
585 

Asn Leu Leu Gin Leu Asn 
600 

Lys Ala Arg lie Lys His 
615 

Leu Lys Asn Met Phe Lys 
630 635 

Glu Gly Val Pro Arg lie 
650 

Gly Leu Leu Leu Gly Thr 
665 

Val Leu Thr Lys Gly lie 
680 

Asp Phe lie Glu He Met 
695 



Asn Val Gin Ser Phe 
380 

He Lys Ala He Phe 
400 

Pro He Ser Tyr Glu 
415 

Val Val Phe Asp Val 
430 

Leu He Gin He Ala 
445 

Gin Phe Asp Glu Phe 
460 

Thr Glu Leu Thr Gly 
480 

Leu Val Thr Val Leu 
495 

Leu Val Ala His Asn 
510 

Tyr Glu Arg His Asp 
525 

Leu Glu Phe Ala Arg 
540 

Gly Pro Leu Thr Lys 
560 

Ala Asn Tyr Asp Ala 
575 

Lys Asp Ala Arg Glu 
590 

Thr Asp Leu Val Ala 
605 

Ala Thr He Tyr Val 
620 

Leu Val Ser Leu Ser 
640 

Pro Arg Thr Val Leu 
655 

Ala Cys Ser Asp Gly 
670 

Asp Ala Ala Val Asp 
6 85 

Pro Pro Ala He Tyr 
700 
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Gin Pro Leu Val Val Arg Glu Leu lie Lys Asp Gin Ala Gly lie Glu 
705 710 715 720 

Gin Val lie Arg Asp Leu lie Glu Val Gly Lys Arg Ala Lys Lys Pro 
725 730 735 

Val Leu Ala Thr Gly Asn Val His Tyr Leu Glu Pro Glu Glu Glu lie 
740 745 750 

Tyr Arg Glu lie lie Val Arg Ser Leu Gly Gin Gly Ala Met lie Asn 
755 760 765 

Arg Thr lie Gly Arg Gly Glu Gly Ala Gin Pro Ala Pro Leu Pro Lys 
770 ~ 775 780 

Ala His Phe Arg Thr Thr Asn Glu Met Leu Asp Glu Phe Ala Phe Leu 
785 790 795 800 

Gly Lys Asp Leu Ala Tyr Gin Val Val Val Gin Asn Thr Gin Asp Phe 
805 810 815 

Ala Asp Arg lie Glu Glu Val Glu Val Val Lys Gly Asp Leu Tyr Thr 
820 825 830 

Pro Tyr lie Asp Lys Ala Glu Glu Thr Val Ala Glu Leu Thr Tyr Gin 
835 840 845 

Lys Ala Phe Glu lie Tyr Gly Asn Pro Leu Pro Asp lie lie Asp Leu 
850 855 860 

Arg lie Glu Lys Glu Leu Thr Ser lie Leu Gly Asn Gly Phe Ala Val 
865 870 875 880 

lie Tyr Leu Ala Ser Gin Met Leu Val Asn Arg Ser Asn Glu Arg Gly 
885 890 895 

Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr 
900 905 910 

Met lie Gly lie Thr Glu Val Asn Pro Met Pro Pro His Tyr Val Cys 
915 920 925 

Pro Ser Cys Gin His Ser Glu Phe lie Thr Asp Gly Ser Val Gly Ser 
930 935 940 

Gly Tyr Asp Leu Pro Asn Lys Pro Cys Pro Lys Cys Gly Thr Pro Tyr 
945 950 955 960 

Gin Lys Asp Gly Gin Asp lie Pro Phe Glu Thr Phe Leu Gly Phe Asp 
965 970 975 

Gly Asp Lys Val Pro Asp lie Asp Leu Asn Phe Ser Gly Asp Asp Gin 
980 985 990 

Pro Ser Ala His Leu Asp Val Arg Asp lie Phe Gly Asp Glu Tyr Ala 
995 1000 1005 

Phe Arg Ala Gly Thr Val Gly Thr Val Ala Glu Lys Thr Ala Tyr Gly 
1010 1015 1020 

Phe Val Lys Gly Tyr Glu Arg Asp Tyr Gly Lys Phe Tyr Arg -Asp Ala 
1025 1030 1035 1040 
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Glu Val Asp Arg Leu Ala Ala Gly Ala Ala Gly Val Lys Arg Thr Thr 
1045 1050 1055 

Gly Gin His Pro Gly Gly lie Val Val lie Pro Asn Tyr Met Asp Val 
1060 1065 1070 

Tyr Asp Phe Thr Pro Val Gin Tyr Pro Ala Asp Asp Val Thr Ala Ser 
1075 1080 1085 

Trp Gin Thr Thr His Phe Asn Phe His Asp lie Asp Glu Asn Val Leu 
1090 1095 1100 

Lys Leu Asp lie Leu Gly His Asp Asp Pro Thr Met lie Arg Lys Leu 
1105 1110 1115 1120 

Gin Asp Leu Ser Gly lie Asp Pro lie Thr He Pro Ala Asp Asp Pro 
1125 1130 1135 

Gly Val Met Ala Leu Phe Ser Gly Thr Glu Val Leu Gly Val Thr Pro 
1140 1145 1150 

Glu Gin lie Gly Thr Pro Thr Gly Met Leu Gly He Pro Glu Phe Gly 
1155 1160 1165 

Thr Asn Phe Val Arg Gly Met Val Asn Glu Thr His Pro Thr Thr Phe 
1170 1175 1180 

Ala Glu Leu Leu Gin Leu Ser Gly Leu Ser His Gly Thr Asp Val Trp 
1185 1190 1195 1200 

Leu Gly Asn Ala Gin Asp Leu He Lys Glu Gly lie Ala Thr Leu Lys 
1205 1210 1215 

Thr Val lie Gly Cys Arg Asp Asp He Met Val Tyr Leu Met His Ala 
1220 1225 1230 

Gly Leu Glu Pro Lys Met Ala Phe Thr He Met Glu Arg Val Arg Lys 
1235 1240 1245 

Gly Leu Trp Leu Lys He Ser Glu Glu Glu Arg Asn Gly Tyr He Asp 
1250 1255 1260 

Ala Met Arg Glu Asn Asn Val Pro Asp Trp Tyr He Glu Ser Cys Gly 
1265 1270 1275 1280 

Lys He Lys Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu 
1285 1290 1295 

Met Ala Leu Arg Val Ala Tyr Phe Lys Val His His Pro He Met Tyr 
1300 1305 1310 

Tyr Cys Ala Tyr Phe Ser lie Arg Ala Lys Ala Phe Glu Leu Lys Thr 
1315 1320 1325 

Met Ser Gly Gly Leu Asp Ala Val Lys Ala Arg Met Glu Asp He Thr 
1330 1335 1340 

He Lys Arg Lys Asn Asn Glu Ala Thr Asn Val Glu Asn Asp Leu Phe 
1345 1350 1355 1360 

Thr Thr Leu Glu He Val Asn Glu Met Leu Glu Arg Gly Phe Lys Phe 
1365 1370 1375 
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Gly Lys Leu Asp Leu Tyr Lys Ser Asp Ala lie Glu Phe Gin lie Lys 

1380 1385 1390 

Gly Asp Thr Leu lie Pro Pro Phe lie Ala Leu Glu Gly Leu Gly Glu 

5 1395 1400 1405 

Asn Val Ala Lys Gin lie Val Lys Ala Arg Gin Glu Gly Glu Phe Leu 
1410 1415 1420 

10 Ser Lys Met Glu Leu Arg Lys Arg Gly Gly Ala Ser Ser Thr Leu Val 

1425 " 1430 1435 1440 



15 



Glu Lys Met Asp Glu Met Gly lie Leu Gly Asn Met Pro Glu Asp Asn 
1445 1450 1455 

Gin Leu Ser Leu Phe Asp Asp Phe Phe 
1460 " ^ 1465 



The present invention also relates to the dnaE gene of Streptococcus 
20 pyogenes encoding the a-small subunit. The partial nucleotide sequence of the dnaE 

gene corresponds to SEQ. ED. No. 19 as follows: 



atgtttgctc aacttgatac taaaactgta tactcattta tggatagttt aattgactta 60 

aatcattatt ttgaacgagc aaagcaattt ggttaccaca ccataggaat catggataag 120 

25 gataatcttt atggtgctta ccattttatt aaaggttgtc aaaaaaatgg actgcagcca 180 

gttttaggtt tggaaataga gattctctat caagagcggc aggtgctcct taacttaatc 240 

gcccagaata cacaaggcta tcatcagctt ttaaaaattt ccacggcaaa aatgtctggc 300 

aagcttcata tggattactt ctgccaacat ttggaaggga tagcggttat tattcctagt 360 

aagggttgga gcgatacatt agtggtccct tttgactact atatgggtgt tgatcagtat 420 

30 actgatttat ctcatatgga ttctaagagg cagcttatac ccctaaggac agttcgttat 480 

tttgcgcaag atgatatgga aaccctgcac atgttgcatg ccattcgaga taacctcagt 54 0 

ctggcagaga cccctgtggt agaaagtgat caagagttag cagattgtca acaactaacc 600 

gccttctatc aaacacactg ccctcaagct ctacagaatt tagaagactt agtgtcagga 660 

atctattatg atttcgatac aaatttaaaa ttgcctcatt ttaatagaga taagtctgcc 720 

35 aagcaagaat tgcaagactt gactgaggct ggtttgaagg aaaaaggatt gtggaaagag 780 

ccttatcaat cgcgcttact acatgaattg gtcattattt ctgacatggg ctttgatgat 840 

tattttttga ttgtgtggga tttacttcgc tttggacgca gtaaaggcta ttatatggga 900 

atgggacgtg gctcggcggc aggtagtcta gtggcttatg ctctgaacat tacagggatt 960 

gatccagttc aacatgattt gctatttgag cgctttttaa acaaagaacg ttatagcatg 1020 

40 cctgatattg atatcgatct tccagatatt taccgttcag aatttctacg gtatgtccga 1080 

aatcgttatg gtagcgacca ttcggcgcaa attgtgacct tttcaacctt tggccaggct 1140 

attcgtgatg ttttcaaacg gttcggggtt ccagaatacg aactgactaa tctcactaaa 1200 

aaaattggtt ttaaagatag cttggctact gtctatgaaa agtcaatctc ttttaggcag 1260 

gttattaata gtagaactga atttcaaaag gcttttgcca ttgccaagcg tatcgaagga 1320 

45 aatccaagac aaacgtccat tcacgcagct ggtattgtga tgagtgatga tgccttgacc 13 80 

aatcatattc ctctaaaatc gggcgatgac atgatgatca cccagtatga tgctcatgcg 1440 

gtcgaagcta atggcctgtt aaaaatggat tttttggggt taagaaattt gacctttgtt 1500 

caaaaaatgc aagagaaggt tgctaaagac tacgggtgtc agattgatat tacagccatt 1560 

gatttagaag acccgcaaac gttggcactt tttgctaaag gggataccaa gggaattttc 1620 

50 caatttgaac aaaatggtgc tattaatctt ttaaaacgga ttaagccaca acgttttgaa 1680 

gaaattgttg ccactaccag tctaaataga ccaggggcaa gtgactatac cactaatttc 1740 

attaaacgaa gagaaggaca agaaaaaatt gatttgattg atcctgtgat tgctcccatt 18 00 

ttagagccaa cttacggtat tatgctttat caagaacaag ttatgcagat tgcacaggtt 1860 

tatgctggtt ttacgttagg caaggccgac ttgttaaggc gtgccatgtc taaaaaaaat 1920 

55 ctacaagaaa tgcaaaaaat ggaagaagac tttattgctt ctgctaagca cctagggaga 1980 

gctgaagaaa cagctagagg actttttaaa cggatggaaa aatttgcagg ttatggtttt 2 04 0 

aaccgcagcc atgcctttgc ctattcagct ttagcttttc aattggctta tttcaaagcc 2100 

cattacccgg ctgtttttta cgatatcatg atgaattatt ctagcagtga ctatatcaca 2160 

gatgctctag aatcagattt tcaagtagcg caagttacca ttaatagtat tccttacact 2220 

60 gataaaattg aagctagcaa gatttacatg gggctgaaaa atattaaggg gttgccaagg 2280 
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gattttgctt 
agaactccag 
tttgattgct 
tttgttaatg 
gattactcag 
aagcatcctt 
ttagtcaaag 
accaaaacaa 
gatgtcacac 
ttctattact 
caagtgcaaa 
tcccaaattt 
caaaaaaata 
aaggaaaaac 



attggattat 
aaaaatatca 
ttgagcctaa 
agcttggttc 
taactgaaaa 
taattgatat 
aaagcgaagc 
gtgggcagca 
tttttccaca 
taaaaggtag 
tggctattag 
ctgagatttt 
aggaaacaat 
ttcgtccttt 



cgagcaaaga 
aaaaaaggtt 
ccgtaaaaaa 
tcttttttca 
atattctttg 
tgctgagaaa 
agtcgtactg 
aatggctttt 
agagtatgcc 
aataaaagaa 
tcaaaaatat 
aggtgccttt 
tgcattaact 
tgttctgaaa 



ccatttaata 
ttccttgagc 
attctggaca 
gattcttcct 
gaacaggaga 
agtacccaaa 
attcaaatag 
ttaagtgtga 
atttataaag 
agagaccatc 
tggttattag 
ccaggaacga 
aagattcagg 
acggtttttc 



gcgtagagga 
ctctgataaa 
atttggatgg 
ttagttgggt 
tcgttggagt 
cttttactcc 
atagcattag 
atgacactaa 
accaattaaa 
gactgcagat 
ttgaaaacca 
ctccagttgt 
ttcatgtaac 
ga 



ttttctcact 
aataggtctg 
tttactggta 
agatacgaaa 
tggcatgagc 
tatttcacag 
gatcattaga 
gaaaaagctc 
agaaggagaa 
ggtgtgtcag 
tcagtttgat 
tattcactat 
agagaattta 



2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3102 



The encoded a-small subunit has an amino acid sequence corresponding to SEQ. ID. 
No. 20 as follows: 



Met Phe Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Asp Ser 
1 5 10 15 

X>eu lie Asp Leu Asn His Tyr Phe Glu Arg Ala Lys Gin Phe Gly Tyr 
20 25 30 

His Thr lie Gly lie Met Asp Lys Asp Asn Leu Tyr Gly Ala Tyr His 
35 40 45 

Phe He Lys Gly Cys Gin Lys Asn Gly Leu Gin Pro Val Leu Gly Leu 
50 55 60 

Glu He Glu lie Leu Tyr Gin Glu Arg Gin Val Leu Leu Asn Leu lie 
65 70 75 80 

Ala Gin Asn Thr Gin Gly Tyr His Gin Leu Leu Lys He Ser Thr Ala 
85 90 95 

Lys Met Ser Gly Lys Leu His Met Asp Tyr Phe Cys Gin His Leu Glu 
100 105 110 

Gly He Ala Val He He Pro Ser Lys Gly Trp Ser Asp Thr Leu Val 
115 120 125 

Val Pro Phe Asp Tyr Tyr Met Gly Val Asp Gin Tyr Thr Asp Leu Ser 
130 135 140 

His Met Asp Ser Lys Arg Gin Leu He Pro Leu Arg Thr Val Arg Tyr 
145 150 155 160 

Phe Ala Gin Asp Asp Met Glu Thr Leu His Met Leu His Ala He Arg 
165 170 175 

Asp Asn Leu Ser Leu Ala Glu Thr Pro Val Val Glu Ser Asp Gin Glu 
180 185 190 

Leu Ala Asp Cys Gin Gin Leu Thr Ala Phe Tyr Gin Thr His Cys Pro 
195 200 205 

Gin Ala Leu Gin Asn Leu Glu Asp Leu Val Ser Gly He Tyr Tyr Asp 
210 215 220 
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Phe Asp Thr Asn Leu Lys Leu Pro His Phe Asn Arg Asp Lys Ser Ala 
225 230 235 240 

Lys Gin Glu Leu Gin Asp Leu Thr Glu Ala Gly Leu Lys Glu Lys Gly 
245 ~ 250 255 

Leu Trp Lys Glu Pro Tyr Gin Ser Arg Leu Leu His Glu Leu Val lie 
260 265 270 

lie Ser Asp Met Gly Phe Asp Asp Tyr Phe Leu lie Val Trp Asp Leu 
275 280 285 

Leu Arg Phe Gly Arg Ser Lys Gly Tyr Tyr Met Gly Met Gly Arg Gly 
290 ~ 295 300 

Ser Ala Ala Gly Ser Leu Val Ala Tyr Ala Leu Asn lie Thr Gly lie 
305 310 315 320 

Asp Pro Val Gin His Asp Leu Leu Phe Glu Arg Phe Leu Asn Lys Glu 
325 330 335 

Arg Tyr Ser Met Pro Asp lie Asp lie Asp Leu Pro Asp lie Tyr Arg 
340 345 " 350 

Ser Glu Phe Leu Arg Tyr Val Arg Asn Arg Tyr Gly Ser Asp His Ser 
355 360 365 

Ala Gin lie Val Thr Phe Ser Thr Phe Gly Pro Lys Gin Ala lie Arg 
370 375 380 

Asp Val Phe Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Thr Asn Leu 
385 390 395 400 

Thr Lys Lys lie Gly Phe Lys Asp Ser Leu Ala Thr Val Tyr Glu Lys 
405 " 410 415 

Ser lie Ser Phe Arg Gin Val lie Asn Ser Arg Thr Glu Phe Gin Lys 
420 425 430 

Ala Phe Ala lie Ala Lys Arg lie Glu Gly Asn Pro Arg Gin Thr Ser 
435 440 445 

lie His Ala Ala Gly lie Val Met Ser Asp Asp Ala Leu Thr Asn His 
450 455 460 

lie Pro Leu Lys Ser Gly Asp Asp Met Met lie Thr Gin Tyr Asp Ala 
465 470 475 480 

His Ala Val Glu Ala Asn Gly Leu Leu Lys Met Asp Phe Leu Gly Leu 
485 490 495 

Arg Asn Leu Thr Phe Val Gin Lys Met Gin Glu Lys Val Ala Lys Asp 
500 505 510 

Tyr Gly Cys Gin lie Asp lie Thr Ala lie Asp Leu Glu Asp Pro Gin 
515 520 525 

Thr Leu Ala Leu Phe Ala Lys Gly Asp Thr Lys Gly lie Phe Gin Phe 
530 535 ' 540 

Glu Gin Asn Gly Ala lie Asn Leu Leu Lys Arg lie Lys Pro Gin Arg 
545 550 555 560 
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Phe Glu Glu lie Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser 
565 570 575 

Asp Tyr Thr Thr Asn Phe lie Lys Arg Arg Glu Gly Gin Glu Lys lie 
580 585 590 

Asp Leu lie Asp Pro Val lie Ala Pro lie Leu Glu Pro Thr Tyr Gly 
595 600 605 

lie Met Leu Tyr Gin Glu Gin Val Met Gin lie Ala Gin Val Tyr Ala 
610 615 620 

Gly Phe Thr Leu Gly Lys Ala Asp Leu Leu Arg Arg Ala Met Ser Lys 
625 630 635 " 640 

Lys Asn Leu Gin Glu Met Gin Lys Met Glu Glu Asp Phe lie Ala Ser 
645 650 655 

Ala Lys His Leu Gly Arg Ala Glu Glu Thr Ala Arg Gly Leu Phe Lys 
660 665 670 

Arg Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Phe 
675 680 685 

Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Ala His Tyr 
690 695 700 

Pro Ala Val Phe Tyr Asp lie Met Met Asn Tyr Ser Ser Ser Asp Tyr 
705 710 715 720 

lie Thr Asp Ala Leu Glu Ser Asp Phe Gin Val Ala Gin Val Thr lie 
725 730 735 

Asn Ser lie Pro Tyr Thr Asp Lys lie Glu Ala Ser Lys lie Tyr Met 
740 745 750 

Gly Leu Lys Asn lie Lys Gly Leu Pro Arg Asp Phe Ala Tyr Trp lie 
755 760 765 

He Glu Gin Arg Pro Phe Asn Ser Val Glu Asp Phe Leu Thr Arg Thr 
770 775 780 

Pro Glu Lys Tyr Gin Lys Lys Val Phe Leu Glu Pro Leu He Lys He 
785 790 795 800 

Gly Leu Phe Asp Cys Phe Glu Pro Asn Arg Lys Lys He Leu Asp Asn 
805 810 815 

Leu Asp Gly Leu Leu Val Phe Val Asn Glu Leu Gly Ser Leu Phe Ser 
820 825 830 

Asp Ser Ser Phe Ser Trp Val Asp Thr Lys Asp Tyr Ser Val Thr Glu 
835 840 ' 845 

Lys Tyr Ser Leu Glu Gin Glu He Val Gly Val Gly Met Ser Lys His 
850 855 860 

Pro Leu He Asp lie Ala Glu Lys Ser Thr Gin Thr Phe Thr Pro He 
865 870 875 880 

Ser Gin Leu Val Lys Glu Ser Glu Ala Val Val Leu He Gin He Asp 
885 890 895 
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Ser lie Arg lie lie Arg Thr Lys Thr Ser Gly Gin Gin Met Ala Phe 
900 905 910 

Leu Ser Val Asn Asp Thr Lys Lys Lys Leu Asp Val Thr Leu Phe Pro 
5 915 * 920 925 

Gin Glu Tyr Ala lie Tyr Lys Asp Gin Leu Lys Glu Gly Glu Phe Tyr 
930 935 940 

10 Tyr Leu Lys Gly Arg lie Lys Glu Arg Asp His Arg Leu Gin Met Val 

945 950 955 960 



15 



Cys Gin Gin Val Gin Met Ala lie Ser Gin Lys Tyr Trp Leu Leu Val 

965 970 " 975 

Glu Asn His Gin Phe Asp Ser Gin lie Ser Glu lie Leu Gly Ala Phe 

980 985 990 



Pro Gly Thr Thr Pro Val Val lie His Tyr Gin Lys Asn Lys Glu Thr 
20 995 1000 " 1005 

lie Ala Leu Thr Lys lie Gin Val Thr Glu Asn Leu Lys Glu Lys Leu 
1010 1015 1020 

25 Arg Pro Phe Val Leu Lys Thr Val Phe Arg 

1025 1030 

The present invention also relates to the holA gene of Streptococcus 
pyogenes encoding the 5 subunit. The holA gene has a nucleotide sequence which 
30 corresponds to SEQ. ID. No. 21 as follows: 



35 



40 



45 



50 



atgattgcga 
gtcacaggag 
gcttttgata 
gatgcagaaa 
gaccatttgt 
gcctttgaag 
. ggtaaattgg 
gaagccaacc 
ctgggtttag 
tttagtcaaa 
agcctaactg 
actagacttg 
ttatctggag 
ttgcagctga 
tcagatattc 
aggaccttat 
cagataaaaa 
atgactcact 



tagaaaagat 
atgacattgg 
aggatgattt 
tggatctagt 
tagatatcac 
cctatttaga 
atagtaagag 
ctctgaaaga 
gtttcgagag 
tcatgaaaaa 
atattgagca 
tcctaggagg 
aagatgacat 
ctattcttgc 
ttgggcggcg 
ctcttgcctt 
caggacttta 
ctcaaaaa 



tgaaaaactg 
tcagtatagc 
ggcctattct 
gagcctaccc 
gaccaataaa 
aaatccctta 
acggcttgtt 
agcagagcta 
tggtgccttt 
catggccttt 
agccattcct 
taaaattgat 
taaattaatc 
tagagatgta 
ggttaatcct 
tctaacagga 
tgagaagagt 



agtaaagaaa 
cagttgaaat 
tactttgata 
ttctttgctg 
aaaagtttct 
gagactactc 
aagcttttga 
agaacttatt 
gaccaattac 
ttaaaagcct 
aaaagtttac 
gcggctagag 
gctatcatgc 
aaaaacgagc 
taccaggtca 
gcggtgaaaa 
tatctagttg 



atttgggtct 
cccgcttaat 
tgtctgaggc 
agcagaaggt 
taaaagaaaa 
gactaattat 
aacgtgatgc 
ttcaaaaata 
ttttgaaatc 
ataaaaaaac 
aagataatat 
atttgattca 
taggccaatt 
aacaactagt 
agtatgcgtt 
ccttgattga 
atattgctct 



tataaccctt 
ggagcagatt 
cgcttatcag 
ggttattttt 
agacctaaag 
ctttgctcca 
ccttgtttta 
cagtcatcaa 
aaacgatgat 
gggaaatatt 
tttcgatctg 
tgatttacgg 
tcgcttattt 
gattagttta 
aaaggattct 
gacagattac 
cttaaaaatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1038 



The encoded 8 subunit has an amino acid sequence corresponding to SEQ. ID. No. 22 
as follows: 



55 



Met lie Ala lie Glu Lys lie Glu Lys Leu Ser Lys Glu Asn Leu Gly 
1 5 10 15 
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Leu lie Thr Leu Val Thr Gly Asp Asp lie Gly Gin Tyr Ser Gin Leu 
20 25 30 

Lys Ser Arg Leu Met Glu Gin lie Ala Phe Asp Lys Asp Asp Leu Ala 
35 40 45 

Tyr Ser Tyr Phe Asp Met Ser Glu Ala Ala Tyr Gin Asp Ala Glu Met 
50 55 60 

Asp Leu Val Ser Leu Pro Phe Phe Ala Glu Gin Lys Val Val lie Phe 
65 70 75 80 

Asp His Leu Leu Asp lie Thr Thr Asn Lys Lys Ser Phe Leu Lys Glu 
85 90 95 

Lys Asp Leu Lys Ala Phe Glu Ala Tyr Leu Glu Asn Pro Leu Glu Thr 
100 105 110 

Thr Arg Leu lie lie Phe Ala Pro Gly Lys Leu Asp Ser Lys Arg Arg 
115 120 125 

Leu Val Lys Leu Leu Lys Arg Asp Ala Leu Val Leu Glu Ala Asn Pro 
130 135 140 

Leu Lys Glu Ala Glu Leu Arg Thr Tyr Phe Gin Lys Tyr Ser His Gin 
145 150 155 160 

Leu Gly Leu Gly Phe Glu Ser Gly Ala Phe Asp Gin Leu Leu Leu Lys 
165 170 175 

Ser Asn Asp Asp Phe Ser Gin lie Met Lys Asn Met Ala Phe Leu Lys 
180 185 190 

Ala Tyr Lys Lys Thr Gly Asn lie Ser Leu Thr Asp lie Glu Gin Ala 
195 200 205 

lie Pro Lys Ser Leu Gin Asp Asn lie Phe Asp Leu Thr Arg Leu Val 
210 215 220 

Leu Gly Gly Lys lie Asp Ala Ala Arg Asp Leu lie His Asp Leu Arg 
225 230 235 240 

Leu Ser Gly Glu Asp Asp lie Lys Leu lie Ala lie Met Leu Gly Gin 
245 250 255 

Phe Arg Leu Phe Leu Gin Leu Thr lie Leu Ala Arg Asp Val Lys Asn 
260 265 270 

Glu Gin Gin Leu Val lie Ser Leu Ser Asp lie Leu Gly Arg Arg Val 
275 280 285 

Asn Pro Tyr Gin Val Lys Tyr Ala Leu Lys Asp Ser Arg Thr Leu Ser 
290 295 ^ 300 

Leu Ala Phe Leu Thr Gly Ala Val Lys Thr Leu lie Glu Thr Asp Tyr 
305 310 315 320 

Gin lie Lys Thr Gly Leu Tyr Glu Lys Ser Tyr Leu Val Asp He Ala 
325 330 335 

Leu Leu Lys He Met Thr His Ser Gin Lys 
340 345 
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The present invention also relates to the holB gene of Streptococcus 
pyogenes encoding the 5' subunit The holB gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 23 as follows: 



5 atggatttag cgcaaaaagc tcctaacgtt tatcaagctt ttcagacaat tttaaagaaa 60 

gaccgtctga atcatgctta tcttttttcg ggtgattttg ctaatgaaga aatggctctt 120 

tttttagcta aggtcatctt ttgtgaacag aaaaaggatc agacgccctg cgggcattgt 180 

cgctcttgtc aattgattga acaaggagat tttgccgatg tgacggtatt ggaaccaaca 240 

gggcaagtga ttaaaacgga tgtggtcaaa gaaatgatgg ctaacttttc tcagacagga 3 00 

10 tatgaaaaca aacgacaagt ttttattatc aaagattgtg acaaaatgca tatcaatgcc 360 

gctaatagct tgctaaaata cattgaggag cctcagggag aagcttacat atttttattg 420 

accaatgatg ataacaaagt gcttccgacc attaaaagtc ggacacaggt ttttcagttt 480 

cctaaaaacg aagcctatct ttaccaattg gcacaagaaa agggattatt aaaccatcag 540 

gctaagctag tagccaaact tgccacaaac accagtcatc tagaacgtct gttgcaaacg 600 

15 agcaagcttt tagaactgat aactcaagca gagcgttttg tatctatttg gctgaaagat 660 

cagttgcagg catatttagc gttgaaccgt ctggtacagt tagcaactga aaaagaagaa 720 

caagatttag ttttgaccct tttgaccttg ctcttggcaa gagagcgtgc gcaaacgcct 780 

ttgacacaat tggaggctgt ctatcaggct aggctcatgt ggcagagcaa tgttaatttt 840 

caaaacacat tagaatatat ggtgatgtca gaa 873 



20 



25 



40 



55 



The encoded 5' subunit has an amino acid sequence corresponding to SEQ. ID. No. 24 
as follows: 

Met Asp Leu Ala Gin Lys Ala Pro Asn Val Tyr Gin Ala Phe Gin Thr 
15 10 15 

lie Leu Lys Lys Asp Arg Leu Asn His Ala Tyr Leu Phe Ser Gly Asp 
20 " 25 30 



Phe Ala Asn Glu Glu Met Ala Leu Phe Leu Ala Lys Val lie Phe Cys 

30 35 40 45 

Glu Gin Lys Lys Asp Gin Thr Pro Cys Gly His Cys Arg Ser Cys Gin 

50 55 60 

35 Leu lie Glu Gin Gly Asp Phe Ala Asp Val Thr Val Leu Glu Pro Thr 

65 70 75 80 



Gly Gin Val lie Lys Thr Asp Val Val Lys Glu Met Met Ala Asn Phe 
85 90 95 

Ser Gin Thr Gly Tyr Glu Asn Lys Arg Gin Val Phe lie lie Lys Asp 
100 105 110 



Cys Asp Lys Met His lie Asn Ala Ala Asn Ser Leu Leu Lys Tyr lie 

45 115 120 125 

Glu Glu Pro Gin Gly Glu Ala Tyr lie Phe Leu Leu Thr Asn Asp Asp 
130 135 140 

50 Asn Lys Val Leu Pro Thr lie Lys Ser Arg Thr Gin Val Phe Gin Phe 

145 150 155 160 



Pro Lys Asn Glu Ala Tyr Leu Tyr Gin Leu Ala Gin Glu Lys Gly Leu 
165 170 175 

Leu Asn His Gin Ala Lys Leu Val Ala Lys Leu Ala Thr Asn Thr Ser 
180 185 190 
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59- 



10 



15 



20 



His Leu Glu Arg 
195 

Gin Ala Glu Arg 
210 

Tyr Leu Ala Leu 
225 

Gin Asp Leu Val 



Ala Gin Thr Pro 
260 

Met Trp Gin Ser 
275 

Met Ser Glu 
290 



Leu Leu Gin Thr Ser Lys Leu Leu Glu Leu lie Thr 
200 205 

Phe Val Ser lie Trp Leu Lys Asp Gin Leu Gin Ala 

215 220 

Asn Arg Leu Val Gin Leu Ala Thr Glu Lys Glu Glu 
230 235 240 

Leu Thr Leu Leu Thr Leu Leu Leu Ala Arg Glu Arg 
245 250 255 

Leu Thr Gin Leu Glu Ala Val Tyr Gin Ala Arg Leu 
265 270 

Asn Val Asn Phe Gin Asn Thr Leu Glu Tyr Met Val 
280 285 



25 



The present invention also relates to the dnaXgene of Streptococcus 
pyogenes encoding the x subunit. The dnaX gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 25 as follows: 



30 



35 



40 



45 



50 



55 



atgtatcaag 
tcggttattt 
cttttttcag 
atgaattgtc 
atcacgaatg 
gatgaaattc 
gtttatatta 
actttggaag 
ccagccacta 
attcgagagc 
ttaaatctca 
caggctttga 
ggttctattt 
acgcaagctc 
gcgacagatt 
caacgtcagt 
atgataacag 
tatgccgaaa 
ttgtcaggag 
caacaattgt 
cctaaaacca 
gttcgaaata 
attctagata 
gcaaatagtg 
agccgaaata 
cccaatattc 
caaatgaaat 
gaagggtttg 



ctctttatcg 
ccacaacttt 
gtcctagagg 
ctaaccaagt 
gaagcttgga 
gtgacattcg 
ttgatgaggt 
aaccgacaga 
ttttatctcg 
atttagcctg 
ttgcaaggcg 
gcttgtcacc 
ccatacttgc 
tggcagcctt 
tattgaccta 
cagctgtttt 
ttgttactag 
tgatgactat 
agttaatctc 
cgcagctcca 
caagctacag 
gccaacaatc 
acatttctgc 
agaatgcgat 
atcttaatga 
tggcagtacc 
cgcaaaaaga 
attttttgct 



gaaataccgg 
aaagcaggca 
gactgggaaa 
cgatggtgaa 
agatgtgatt 
agacaaatca 
tcacatgtta 
atgttgtctt 
tgtgcaacgc 
ggttttggac 
agcagaagga 
agataatcag 
tctgggtgac 
agagaccatt 
tctgcgtgat 
tgataccaat 
tcatctccct 
ccaattagct 
agagattgaa 
atcgcgtcct 
ggttgatcgg 
tcgacaatat 
ccaagacaga 
tttggctttc 
tatgtttggt 
aaggacagat 
cagtgttcaa 
cgataaaata 



agccaaacgt 
gttgaatctg 
acaagtgcgg 
ccctgtaatc 
gaaattgatg 
acctatgcgc 
tcaacagggg 
tatcttggca 
tttgaattca 
aaagaaggta 
ggcatgcgtg 
gtcgccattg 
tatgttcgat 
tatgatagtg 
ttattggtgg 
ttgtctctct 
gaaatcaaaa 
cagaaagagc 
acgctcaaaa 
gattcactgg 
gttaccattt 
ctagatgctc 
gccttattga 
gaggctgcct 
aacattatga 
tttcagcata 
gaagaacaag 
aatactattg 



ttgacgaaat 
gcaagattag 
caaagatttt 
aatgcgatat 
ctgcctcgaa 
caagtcgtgc 
cttttaatgc 
acaacggaat 
aagctattaa 
ttgcctatga 
atgctttatc 
caattgccga 
atgtctccca 
ggaagagcat 
ttaaagctgg 
cgatagatcg 

agggaaccca 

agattttgtc 
atgagttggc 
caagatctga 
tgaaaatcat 
taaaaaatgc 
tgggctctga 
ttaatgcaga 
gtaaagctgc 
ttcgtaagga 
aagtagcgct 
acgac 



ggtgggacaa 
ccatgcttat 
tgcaaaggcc 
ttgccgagat 
taatggtgtt 
gacttacaag 
gcttttgaaa 
gcataaaatt 
gcaaaaagct 
ggtggatgct 
tattttagat 
agaaattaca 
agaacaggct 
gagccgcttt 
cggcgacaat 
tatattccaa 
tcctcggatt 
ccaagtaaac 
acaacttaaa 
taaaacgaaa 
ggaagaaacg 
ttggaatgaa 
gcctgtctta 
acaagtcatg 
tggtttttct 
atttgctcag 
tgatattcca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1665 



The encoded x subunit has an amino acid sequence corresponding to SEQ. ID. No. 26 
as follows: 
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Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Thr Phe Asp Glu 
1 5 10 15 

Met Val Gly Gin Ser Val lie Ser Thr Thr Leu Lys Gin Ala Val Glu 
20 25 30 

Ser Gly Lys lie Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Ala Ala Lys lie Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Asp Gly Glu Pro Cys Asn Gin Cys Asp lie Cys Arg Asp 
65 70 * 75 *" 80 

lie Thr Asn Gly Ser Leu Glu Asp Val lie Glu lie Asp Ala Ala Ser 
85 90 95 

Asn Asn Gly Val Asp Glu lie Arg Asp lie Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Arg Ala Thr Tyr Lys Val Tyr lie lie Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Glu Asn Val Phe lie Leu Ala Thr Thr Glu Leu His Lys lie 
145 150 155 160 

Pro Ala Thr lie Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ala lie 
165 170 175 

Lys Gin Lys Ala lie Arg Glu His Leu Ala Trp Val Leu Asp Lys Glu 
180 185 190 

Gly lie Ala Tyr Glu Val Asp Ala Leu Asn Leu lie Ala Arg Arg Ala 
195 200 205 

Glu Gly Gly Met Arg Asp Ala Leu Ser lie Leu Asp Gin Ala Leu Ser 
210 215 220 

Leu Ser Pro Asp Asn Gin Val Ala lie Ala lie Ala Glu Glu lie Thr 
225 230 235 240 

Gly Ser lie Ser lie Leu Ala Leu Gly Asp Tyr Val Arg Tyr Val Ser 
245 250 255 

Gin Glu Gin Ala Thr Gin Ala Leu Ala Ala Leu Glu Thr lie Tyr Asp 
260 265 270 

Ser Gly Lys Ser Met Ser Arg Phe Ala Thr Asp Leu Leu Thr Tyr Leu 
275 280 ' 285 

Arg Asp Leu Leu Val Val Lys Ala Gly Gly Asp Asn Gin Arg Gin Ser 
290 295 ~* 300 

Ala Val Phe Asp Thr Asn Leu Ser Leu Ser He Asp Arg He Phe Gin 
305 310 315 320 

Met He Thr Val Val Thr Ser His Leu Pro Glu He Lys Lys Gly Thr 
325 330 335 
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His Pro Arg lie Tyr Ala Glu Met Met Thr lie Gin Leu Ala Gin Lys 
340 345 350 

Glu Gin lie Leu Ser Gin Val Asn Leu Ser Gly Glu Leu lie Ser Glu 
355 360 365 

lie Glu Thr Leu Lys Asn Glu Leu Ala Gin l^eu Lys Gin Gin Leu Ser 
370 375 380 

Gin Leu Gin Ser Arg Pro Asp Ser Leu Ala Arg Ser Asp Lys Thr Lys 
385 390 395 400 

Pro Lys Thr Thr Ser Tyr Arg Val Asp Arg Val Thr lie Leu Lys lie 
405 410 415 

Met Glu Glu Thr Val Arg Asn Ser Gin Gin Ser Arg Gin Tyr Leu Asp 
420 425 430 

Ala Leu Lys Asn Ala Trp Asn Glu lie Leu Asp Asn lie Ser Ala Gin 
435 440 445 

Asp Arg Ala Leu Leu Met Gly Ser Glu Pro Val Leu Ala Asn Ser Glu 
450 455 460 

Asn Ala lie Leu Ala Phe Glu Ala Ala Phe Asn Ala Glu Gin Val Met 
465 470 475 480 

Ser Arg Asn Asn Leu Asn Asp Met Phe Gly Asn lie Met Ser Lys Ala 
485 490 495 

Ala Gly Phe Ser Pro Asn lie Leu Ala Val Pro Arg Thr Asp Phe Gin 
500 505 510 

His lie Arg Lys Glu Phe Ala Gin Gin Met Lys Ser Gin Lys Asp Ser 
515 520 525 

Val Gin Glu Glu Gin Glu Val Ala Leu Asp lie Pro Glu Gly Phe Asp 
530 535 540 

Phe Leu Leu Asp Lys lie Asn Thr lie Asp Asp 
545 ~ 550 555 

The present invention also relates to the dnaN gene of Streptococcus 
pyogenes encoding the p subunit. The dnaN gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 27 as follows: 



atgattcaat tttcaattaa tcgcacatta tttattcatg ctttaaatac aactaaacgt 60 

gctattagca ctaaaaatgc cattcctatt ctttcatcaa taaaaattga agtcacttct 120 

acaggagtaa ctttaacagg gtctaacggt caaatatcaa ttgaaaacac tattcctgta 18 0 

agtaatgaaa atgctggttt gctaattacc tctccaggag ctattttatt agaagctagt 24 0 

ttttttatta atattatttc aagtttgcca gatattagta taaatgttaa agaaattgaa 300 

caacaccaag ttgttttaac cagtggtaaa tcagagatta ccttaaaagg aaaagatgtt 360 

gaccagtatc ctcgtctaca agaagtatca acagaaaatc ctttgatttt aaaaacaaaa 42 0 

ttattgaagt ctattattgc tgaaacagct tttgcagcca gtttacaaga aagtcgtcct 4 80 

attttaacag gagttcatat tgtattaagt aatcataaag attttaaagc agtagcgact 54 0 

gactctcatc gtatgagcca acgtttaatc actttggaca atacttcagc agatttgatg 600 

gtagttcttc caagtaaatc tttgagagaa ttttcagcag tatttacaga tgatattgag 660 

accgttgagg tatttttctc accaagccaa atcttgttca gaagtgaaca catttctttt 720 
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tatacacgcc tcttagaagg aaattatccc gatacagacc gtttattaat gacagaattt 780 

gagacggagg ttgttttcaa tacccaatcc cttcgccacg ctatggaacg tgccttcttg 84 0 

atttctaatg ctactcaaaa tggtactgtt aagcttgaga ttactcaaaa tcatatttca 900 

gctcatgtta actcacctga ggttggtaag gtaaacgagg atttagatat tgttagtcag 960 

tctggtagtg atttaactat cagcttcaat ccaacttacc ttattgagtc tttaaaagct 1020 

attaaaagtg aaacagtaaa aattcatttc ttatcaccag ttcgaccatt caccctaaca 1080 

ccaggcgatg aggaagaaag ttttatccaa ttaattacac cagtacgaac aaac 1134 

The encoded (3 subunit has an amino acid sequence corresponding to SEQ. ID. No. 28 
as follows: 



Met lie Gin Phe Ser lie Asn Arg Thr Leu Phe lie His Ala Leu Asn 
1 5 10 15 

Thr Thr Lys Arg Ala lie Ser Thr Lys Asn Ala lie Pro lie Leu Ser 
20 25 30 

Ser He Lys He Glu Val Thr Ser Thr Gly Val Thr Leu Thr Gly Ser 
35 40 45 

Asn Gly Gin He Ser He Glu Asn Thr He Pro Val Ser Asn Glu Asn 
50 55 60 

Ala Gly Leu Leu He Thr Ser Pro Gly Ala He Leu Leu Glu Ala Ser 
65 70 75 80 

Phe Phe He Asn He He Ser Ser Leu Pro Asp He Ser lie Asn Val 
85 90 95 

Lys Glu He Glu Gin His Gin Val Val Leu Thr Ser Gly Lys Ser Glu 
100 105 110 

He Thr Leu Lys Gly Lys Asp Val Asp Gin Tyr Pro Arg Leu Gin Glu 
115 120 125 

Val Ser Thr Glu Asn Pro Leu lie Leu Lys Thr Lys Leu Leu Lys Ser 
130 135 140 

lie He Ala Glu Thr Ala Phe Ala Ala Ser Leu Gin Glu Ser Arg Pro 
145 150 155 160 

lie Leu Thr Gly Val His lie Val Leu Ser Asn His Lys Asp Phe Lys 
165 170 175 

Ala Val Ala Thr Asp Ser His Arg Met Ser Gin Arg Leu He Thr Leu 
180 185 190 

Asp Asn Thr Ser Ala Asp Leu Met Val Val Leu Pro Ser Lys Ser Leu 
195 200 205 

Arg Glu Phe Ser Ala Val Phe Thr Asp Asp He Glu Thr Val Glu Val 
210 215 220 

Phe Phe Ser Pro Ser Gin He Leu Phe Arg Ser Glu His He Ser Phe 
225 230 235 240 

Tyr Thr Arg Leu Leu Glu Gly Asn Tyr Pro Asp Thr Asp Arg Leu Leu 
24 5 250 255 

Met Thr Glu Phe Glu Thr Glu Val Val Phe Asn Thr Gin Ser Leu Arg 
260 265 270 
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His Ala Met Glu Arg Ala Phe Leu lie Ser Asn Ala Thr Gin Asn Gly 
275 280 285 

Thr Val Lys Leu Glu He Thr Gin Asn His He Ser Ala His Val Asn 
290 295 300 

Ser Pro Glu Val Gly Lys Val Asn Glu Asp Leu Asp He Val Ser Gin 
305 310 315 320 

Ser Gly Ser Asp Leu Thr He Ser Phe Asn Pro Thr Tyr Leu He Glu 
325 330 335 

Ser Leu Lys Ala He Lys Ser Glu Thr Val Lys lie His Phe Leu Ser 
340 345 350 

Pro Val Arg Pro Phe Thr Leu Thr Pro Gly Asp Glu Glu Glu Ser Phe 
355 360 365 

He Gin Leu He Thr Pro Val Arg Thr Asn 
370 375 

The present invention also relates to the ssb gene of Streptococcus 
pyogenes encoding the single strand-binding protein (SSB). The ssb gene has a 
nucleotide sequence which corresponds to SEQ. ID. No. 29 as follows: 

atgattaata atgtagtact agttggtcgc atgaccaagg atgcagaact tcgttacaca 60 
ccaagtcaag tagctgtggc taccttcaca cttgctgtta accgtacctt taaaagccaa 120 
aatggtgaac gcgaggcaga tttcattaac tgtgtgatct ggcgtcaacc ggctgaaaat 180 
ttagcgaact gggctaaaaa aggtgctttg atcggagtta cgggtcgtat tcatacacgt 24 0 
aactacgaaa accaacaagg acaacgtgtc tatgtaacag aagttgttgc agataatttc 3 00 
caaatgttgg aaagtcgtgc tacacgtgaa ggtggctcaa ctggctcatt taatggtggt 3 60 
tttaacaata acacttcatc atcaaacagt tactcagcgc ctgcacaaca aacgcctaac 42 0 
tttggaagag atgatagccc atttgggaac tcaaacccga tggatatctc agatgacgat 480 
cttccattct ag 492 



The encoded SSB protein has an amino acid sequence corresponding to SEQ. ID. 
No. 30 as follows: 



Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Lys Asp Ala Glu 

15 10 15 

Leu Arg Tyr Thr Pro Ser Gin Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 

Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

He Asn Cys Val He Trp Arg Gin Pro Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ala Leu He Gly Val Thr Gly Arg He Gin Thr Arg 

65 70 75 80 

Asn Tyr Glu Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 

85 90 95 
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Ala Asp Asn Phe Gin Met Leu Glu Ser Arg Ala Thr Arg Glu Gly Gly 
100 105 110 

Ser Thr Gly Ser Phe Asn Gly Gly Phe Asn Asn Asn Thr Ser Ser Ser 
115 120 125 

Asn Ser Tyr Ser Ala Pro Ala Gin Gin Thr Pro Asn Phe Gly Arg Asp 
130 135 140 

Asp Ser Pro Phe Gly Asn Ser Asn Pro Met Asp lie Ser Asp Asp Asp 
145 " 150 155 160 



15 



Leu Pro Phe 



The present invention also relates to the dnaG gene of Streptococcus 
pyogenes encoding the primase. The dnaG gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 31 as follows: 



20 



25 



30 



35 



40 



45 



50 



atgggatttt 
aaaaatagcg 
cggcattacc 
gaagacagac 
attgaggaat 
ggtatgtcgc 
aatcacgctt 
accactacca 
ttaattgagc 
ctttctaaaa 
caatccaata 
cgagggcata 
caggcaaagt 
catctggaca 
tttatggacg 
acggcattga 
atttatgatg 
gattttgttg 
cggcattccc 
ttttttattg 
gtggagaaaa 
attaacaaga 
aatgcattaa 
aatcttgtga 
ctcatgcatc 
ttttattttg 
attacatctt 
ttagaagaaa 
cgtgccaaac 
agtaacaaag 
cgaaaaatgg 



tatggggagg 
ttaatattgt 
tcgggctttg 
aattttttca 
accgccaagt 
ttaatatacc 
tgatgacact 
ttggtcaaga 
atttcaatat 
aatacgagga 
ccatttacga 
ttattgcctt 
ataaaaattc 
aggcaaggcc 
tgattgccgc 
ctcaagaaca 
gtgacgatgc 
tcgaaattgt 
cagaagcatt 
attacctaaa 
tggcaccatt 
ttgctgattt 
ggattcaaga 
ccttaccaat 
ggctcttaca 
atacctctac 
atgatttgtc 
accttcccaa 
ttttagcaga 
gcgatcatca 
aatag 



tgacgatttg 
cgatgtcatt 
cccatttcat 
ctgctttggc 
ccccttctta 
gccaagtcag 
tcatgaggat 
agctaggaag 
tggtttagcc 
aggtcaattg 
cgcctttcga 
ttcaggacgt 
aagaggaaca 
tgttattgcc 
ttaccgttcc 
tgtcaatcac 
tggacaacat 
cagaatcccc 
tgcagatttg 
acctactaat 
gattgctcaa 
gttgccaaac 
taggcaaaaa 
gccaaaaagt 
tcatgactac 
cttagaatta 
agagatgtca 
agaagtagct 
gcgcgatctt 
agcggctcta 



gcaattgaca 
ggagaagtgg 
aaggaaaaga 
tgtggaaaat 
gaaagtgttc 
gcagtacttg 
gctgctaaat 
tacctttacc 
ccagatgagt 
gttgcttcag 
aatcgtatca 
atctggacgg 
gttcttttta 
aaaacccatg 
ggctatgaaa 
cttaagcaag 
gctattgcaa 
aataaaatgg 
cttaagcagt 
gtagacaatt 
tcaccatcca 
tttgactatt 
catcaaggtc 
ttgacagcta 
ttattaaatg 
ctttatcaac 
gaggaagtta 
cttggtgaga 
cacaaacaag 
gaagtactag 



aagaaatgat 
tcaaactttc 
caccctcttt 
caggggatgt 
agattattgc 
ctagccaaca 
t ttaccatgc 
agagaggctt 
cagattatct 
gattgtttca 
tgtttccctt 
cagctgatat 
acaaatctta 
aagtgtttct 
atgctgttgc 
tcactaaaaa 
aatcactaga 
atcctgacga 
cacggatcag 
tgcaatcaca 
tcacagctca 
ttcaagtaga 
aaatagctca 
ttgctaagac 
aatttcgaca 
ggctgaagca 
accgtgctta 
ttgatgatat 
ggaaaaaagt 
aacattttat 



ttcccaagta 
ccgatcaggg 
taatgttgtt 
ttttaaattt 
cgataagact 
caagcaccct 
agttttgatg 
ggatgaccaa 
ttatcaagct 
cttgtccgat 
atcagatgac 
ggaaaagaga 
tgaattgtat 
aatggaaggg 
tt caa tgggg 
agttgttttg 
attgcttaaa 
atttgtacaa 
tagtgttgaa 
aattgtttat 
acattcgtat 
acaatcagta 
agccgtcagc 
agaaagtcat 
tcgtgatgat 
acaaggacac 
ttacaatgtt 
tttatccaaa 
tagagaatct 
tgcgcagaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620. 

1680 

1740 

1800 

1815 



55 



The encoded primase has an amino acid sequence corresponding to SEQ. ID. No. 32 
as follows: 
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Met Gly Phe Leu Trp Gly Gly Asp Asp Leu Ala lie Asp Lys Glu Met 
15 10 15 

lie Ser Gin Val Lys Asn Ser Val Asn lie Val Asp Val lie Gly Glu 
20 25 30 

Val Val Lys Leu Ser Arg Ser Gly Arg His Tyr Leu Gly Leu Cys Pro 
35 40 45 

Phe His Lys Glu Lys Thr Pro Ser Phe Asn Val Val Glu Asp Arg Gin 
50 55 60 

Phe Phe His Cys Phe Gly Cys Gly Lys Ser Gly Asp Val Phe Lys Phe 
65 70 75 80 

lie Glu Glu Tyr Arg Gin Val Pro Phe Leu Glu Ser Val Gin lie lie 
85 90 95 

Ala Asp Lys Thr Gly Met Ser Leu Asn lie Pro Pro Ser Gin Ala Val 
100 105 110 

Leu Ala Ser Gin His Lys His Pro Asn His Ala Leu Met Thr Leu His 
115 120 125 

Glu Asp Ala Ala Lys Phe Tyr His Ala Val Leu Met Thr Thr Thr lie 
130 135 140 

Gly Gin Glu Ala Arg Lys Tyr Leu Tyr Gin Arg Gly Leu Asp Asp Gin 
145 150 155 "* " 160 

Leu lie Glu His Phe Asn lie Gly Leu Ala Pro Asp Glu Ser Asp Tyr 
165 170 * 175 

Leu Tyr Gin Ala Leu Ser Lys Lys Tyr Glu Glu Gly Gin Leu Val Ala 
180 185 190 

Ser Gly Leu Phe His Leu Ser Asp Gin Ser Asn Thr lie Tyr Asp Ala 
195 200 205 

Phe Arg Asn Arg lie Met Phe Pro Leu Ser Asp Asp Arg Gly His lie 
210 215 220 

He Ala Phe Ser Gly Arg He Trp Thr Ala Ala Asp Met Glu Lys Arg 
225 230 235 ^ 240 

Gin Ala Lys Tyr Lys Asn Ser Arg Gly Thr Val Leu Phe Asn Lys Ser 
245 250 255 

Tyr Glu Leu Tyr His Leu Asp Lys Ala Arg Pro Val He Ala Lys Thr 
260 265 270 

His Glu Val Phe Leu Met Glu Gly Phe Met Asp Val He Ala Ala Tyr 
275 280 285 

Arg Ser Gly Tyr Glu Asn Ala Val Ala Ser Met Gly Thr Ala Leu Thr 
290 295 300 

Gin Glu His Val Asn His Leu Lys Gin Val Thr Lys Lys Val Val Leu 
305 310 315 320 

He Tyr Asp Gly Asp Asp Ala Gly Gin His Ala He Ala Lys Ser Leu 
325 330 335 
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Glu Leu Leu Lys Asp Phe Val Val Glu lie Val Arg lie Pro Asn Lys 
340 345 350 

Met Asp Pro Asp Glu Phe Val Gin Arg His Ser Pro Glu Ala Phe Ala 
355 360 365 

Asp Leu Leu Lys Gin Ser Arg lie Ser Ser Val Glu Phe Phe lie Asp 
370 375 380 

Tyr Leu Lys Pro Thr Asn Val Asp Asn Leu Gin Ser Gin lie Val Tyr 
385 390 395 400 

Val Glu Lys Met Ala Pro Leu lie Ala Gin Ser Pro Ser lie Thr Ala 
405 410 415 

Gin His Ser Tyr lie Asn Lys lie Ala Asp Leu Leu Pro Asn Phe Asp 
420 425 430 

Tyr Phe Gin Val Glu Gin Ser Val Asn Ala Leu Arg lie Gin Asp Arg 
435 440 445 

Gin Lys His Gin Gly Gin lie Ala Gin Ala Val Ser Asn Leu Val Thr 
450 455 460 

Leu Pro Met Pro Lys Ser Leu Thr Ala lie Ala Lys Thr Glu Ser His 
465 470 475 480 

Leu Met His Arg Leu Leu His His Asp Tyr Leu Leu Asn Glu Phe Arg 
485 490 495 

His Arg Asp Asp Phe Tyr Phe Asp Thr Ser Thr Leu Glu Leu Leu Tyr 
500 505 510 

Gin Arg Leu Lys Gin Gin Gly His lie Thr Ser Tyr Asp Leu Ser Glu 
515 520 525 

Met Ser Glu Glu Val Asn Arg Ala Tyr Tyr Asn Val Leu Glu Glu Asn 
530 535 540 

Leu Pro Lys Glu Val Ala Leu Gly Glu lie Asp Asp lie Leu Ser Lys 
545 550 555 560 

Arg Ala Lys Leu Leu Ala Glu Arg Asp Leu His Lys Gin Gly Lys Lys 
565 ~ 570 575 

Val Arg Glu Ser Ser Asn Lys Gly Asp His Gin Ala Ala Leu Glu Val 
580 585 590 

Leu Glu His Phe lie Ala Gin Lys 
595 600 

The present invention also relates to the dnaB gene of Streptococcus 
pyogenes encoding DnaB. The dnaB gene has a nucleotide sequence which 
corresponds to SEQ. ID. No. 33 as follows: 



atgaggttgc ctgaagtagc tgaattacga 
tctgttcttg ggtcaatctt tatctcacct 
agtccagacg atttttataa gtacgctcat 
agcgatcgta atgatgccat tgatgcaacc 



gttcaacccc aagatttact agcagagcaa 60 
gataagctga ttgcagtgag agaatttatc 120 
aaaattatct ttcgggcaat gattaccctc 18 0 
actataagaa caatcctaga tgatcaagat 24 0 
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gatctgcaaa gtattggtgg cttatcctat attgttgaac tagttaatag tgtcccaact 300 

agtgctaatg cagaatatta tgctaaaatt gtagctgaga aagctatgtt gcgtgatatt 3 60 

attgctaggt tgacagaatc tgtcaaccta gcttatgatg aaattttaaa accagaagag 4 20 

gttatcgctg gagttgagag agctttaatt gaactcaatg aacatagtaa tcgtagtggg 4 80 

5 tttcgcaaaa tttcagatgt gctaaaagtt aattacgagg ctttagaagc acgttctaag 540 

cagacttcaa atgttacagg tttaccaact ggttttagag accttgacaa gattacaaca 600 

ggtttacacc cagatcaatt agttatttta gctgctcggc cagcagtggg gaagactgcc 660 

tttgttctta atattgcgca aaatgtgggg actaagcaaa aaaagactgt tgctattttt 720 

tctttggaaa tgggtgctga aagtttagta gatcgtatgc ttgcagcaga aggaatggtt 780 

10 gattcgcaca gtttaagaac agggcaactc acagatcagg attggaataa tgtaacaatt 840 

gctcagggag ctttggcaga agcaccgatt tatattgacg atacgcccgg gattaaaatt 900 

actgaaatcc gcgcaagatc acggaaattg tctcaagaag tggatggtgg tttaggtctc 960 

attgtaattg actacttaca gttgattaca ggaactaaac ccgaaaatcg tcagcaagag 102 0 

gtttcagata tttcaagaca gcttaaaatc ctagctaaag aattgaaagt accagttatt 1080 

15 gccctaagtc agctttctcg tggcgttgag caaaggcaag ataaacgacc agttttatca 114 0 

gatattcgtg aatcaggatc tattgagcag gatgccgata ttgtagcctt cttataccgg 1200 

gacgattatt accgtaaaga atgtgatgat gctgaagaag ctgttgaaga taacacaatt 1260 

gaagttatcc tcgagaaaaa tagagctggg gcgcgtggaa cagtcaaact gatgttccaa 1320 

aaagaataca acaaattctc aagtatagcc cagtttgaag aaagataa 1368 



20 



The encoded DnaB has an amino acid sequence corresponding to SEQ. ID. No. 34 as 
follows: 



Met Arg Leu Pro Glu Val Ala Glu Leu Arg Val Gin Pro Gin Asp Leu 
25 1 5 10 15 

Leu Ala Glu Gin Ser Val Leu Gly Ser lie Phe lie Ser Pro Asp Lys 
20 25 30 

30 Leu lie Ala Val Arg Glu Phe lie Ser Pro Asp Asp Phe Tyr Lys Tyr 

35 40 45 



35 



50 



Ala His Lys lie lie Phe Arg Ala Met lie Thr Leu Ser Asp Arg Asn 

50 55 60 

Asp Ala lie Asp Ala Thr Thr lie Arg Thr lie Leu Asp Asp Gin Asp 

65 70 75 80 



Asp Leu Gin Ser lie Gly Gly Leu Ser Tyr lie Val Glu Leu Val Asn 
40 85 90 95 

Ser Val Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys lie Val Ala 

100 105 110 

45 Glu Lys Ala Met Leu Arg Asp lie lie Ala Arg Leu Thr Glu Ser Val 

115 120 125 



Asn Leu Ala Tyr Asp Glu lie Leu Lys Pro Glu Glu Val lie Ala Gly 
130 135 140 

Val Glu Arg Ala Gin Gly Ala Leu Ala Glu Ala Pro lie Tyr lie Asp 
145 150 155 160 



Asp Thr Pro Gly lie Lys lie Ala Leu lie Glu Leu Asn Glu His Ser 
55 165 170 175 

Asn Arg Ser Gly Phe Arg Lys lie Ser Asp Val Leu Lys Val Asn Tyr 

180 185 190 

60 Glu Ala Leu Glu Ala Arg Ser Lys Gin Thr Ser Asn Val Thr Gly Leu 

195 200 205 
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Pro Thr Gly Phe Arg Asp 
210 

Asp Gin Leu Val lie Leu 
225 230 

Phe Val Leu Asn lie Ala 
245 

Val Ala lie Phe Ser Leu 
260 

Met Leu Ala Ala Glu Gly 
275 

Gin Leu Thr Asp Gin Asp 
290 

Ala Arg Ser Arg Lys Leu 
305 310 

lie Val lie Asp Tyr Leu 
325 

Arg Gin Gin Glu Val Ser 
340 

Lys Glu Leu Lys Val Pro 
355 

Val Glu Gin Arg Gin Asp 
370 

Ser Gly Ser He Glu Gin 
385 390 

Asp Asp Tyr Tyr Arg Lys 
405 

Asp Asn Thr He Glu Val 
420 

Gly Thr Val Lys Leu Met 
435 

He Ala Gin Phe Glu Glu 

450 



Leu Asp Lys He Thr 
215 

Ala Ala Arg Pro Ala 
235 

Gin Asn Val Gly Thr 
250 

Glu Met Gly Ala Glu 
265 

Met Val Asp Ser His 
280 

Trp Asn Asn Val Thr 
295 

Ser Gin Glu Val Asp 
315 

Gin Leu He Thr Gly 
330 

Asp He Ser Arg Gin 
345 

Val He Ala Leu Ser 
360 

Lys Arg Pro Val Leu 
375 

Asp Ala Asp He Val 
395 

Glu Cys Asp Asp Ala 
410 

lie Leu Glu Lys Asn 
425 

Phe Gin Lys Glu Tyr 
440 

Arg 
455 



Thr Gly Leu His Pro 
220 

Val Gly Lys Thr Ala 
240 

Lys Gin Lys Lys Thr 
255 

Ser Leu Val Asp Arg 
270 

Ser Leu Arg Thr Gly 
285 

He Thr Glu lie Arg 
300 

Gly Gly Leu Gly Leu 
320 

Thr Lys Pro Glu Asn 
335 

Leu Lys He Leu Ala 
350 

Gin Leu Ser Arg Gly 
365 

Ser Asp He Arg Glu 
380 

Ala Phe Leu Tyr Arg 
400 

Glu Glu Ala Val Glu 
415 

Arg Ala Gly Ala Arg 
430 

Asn Lys Phe Ser Ser 
445 



Fragments of the above polypeptides or proteins are also encompassed 
by the present invention. 

Suitable fragments can be produced by several means. In the first, 
subclones of the gene encoding the protein of the present invention are produced by 
conventional molecular genetic manipulation by subcloning gene fragments. The 
subclones then are expressed in vitro or in vivo in bacterial cells to yield a smaller 
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protein or peptide that can be tested for activity according to the procedures described 
below. 

As an alternative, fragments of replication proteins can be produced by 
digestion of a full-length replication protein with proteolytic enzymes like 
5 chymotrypsin or Staphylococcus proteinase A, or trypsin. Different proteolytic 

enzymes are likely to cleave replication proteins at different sites based on the amino 
acid sequence of the protein. Some of the fragments that result from proteolysis may 
be active and can be tested for activity as described below. 

In another approach, based on knowledge of the primary structure of 

10 the protein, fragments of a replication protein gene may be synthesized by using the 

PCR technique together with specific sets of primers chosen to represent particular 
portions of the protein. These then would be cloned into an appropriate vector for 
increased expression of a truncated peptide or protein. 

Chemical synthesis can also be used to make suitable fragments. Such 

15 a synthesis is carried out using known amino acid sequences of replication proteins 

being produced. Alternatively, subjecting a full length replication protein to high 
temperatures and pressures will produce fragments. These fragments can then be 
separated by conventional procedures (e.g., chromatography, SDS-PAGE). 

Variants may also (or alternatively) be modified by, for example, the 

20 deletion or addition of amino acids that have minimal influence on the properties, 

secondary structure, and hydropathic nature of the polypeptide. For example, a 
polypeptide may be conjugated to a signal (or leader) sequence at the N-terminal end 
of the protein which cotranslationally or post-translationally directs transfer of the 
protein. The polypeptide may also be conjugated to a linker or other sequence for 

25 ease of synthesis, purification, or identification of the polypeptide. 

Suitable DNA molecules are those that hybridize to a DNA molecule 
comprising a nucleotide sequence of at least about 20, more preferably at least about 
30 to about 50, continuous bases of either SEQ. ID. Nos. 1, 3, 5, 7, 9, 1 1, 13, 15, 17, 
19, 21, 23, 25, 27, 29, 31, or 33 under stringent conditions such as those characterized 

30 by a hybridization buffer comprising 0.9M sodium citrate ("SSC") buffer at a 

temperature of about 37°C and remaining bound when subject to washing the SSC 
buffer at a temperature of about 37°C; and preferably in a hybridization buffer 
comprising 20% formamide in 0.9M SSC buffer at a temperature of about 42°C and 



WO 01/09164 



-70- 



PCT/US00/20666 



remaining bound when subject to washing at about 42°C with 0.2x SSC buffer. 
Stringency conditions can be further varied by modifying the temperature and/or salt 
content of the buffer, or by modifying the length of the hybridization probe. 

The proteins or polypeptides of the present invention are preferably 
5 produced in purified form (preferably at least 80%, more preferably 90%, pure) by 

conventional techniques. Typically, the proteins or polypeptides of the present 
invention is secreted into the growth medium of recombinant host cells. 
Alternatively, the proteins or polypeptides of the present invention are produced but 
not secreted into growth medium. In such cases, to isolate the protein, the host cell 

10 (e.g., E. coli) carrying a recombinant plasmid is propagated, lysed by sonication, heat, 

or chemical treatment, and the homogenate is centrifuged to remove bacterial debris. 
The supernatant is then subjected to purification procedures such as ammonium 
sulfate precipitation, gel filtration, ion exchange chromatography, FPLC, and HPLC. 

The DNA molecule encoding replication polypeptides or proteins 

15 derived from Gram positive bacteria can be incorporated in cells using conventional 

recombinant DNA technology. Generally, this involved inserting the DNA molecule 
into an expression system to which the DNA molecule is heterologous (i.e. not 
normally present). The heterologous DNA molecule is inserted into the expression 
system or vector in proper sense orientation and correct reading frame. The vector 

20 contains the necessary elements for the transcription and translation of the inserted 

protein-coding sequences. 

U.S. Patent No. 4,237,224 to Cohen and Boyer, which is hereby 
incorporated by reference, describes the production of expression systems in the form 
of recombinant plasmids using restriction enzyme cleavage and ligation with DNA 

25 ligase. These recombinant plasmids are then introduced by means of transformation 

and replicated in unicellular cultures including procaryotic organisms and eucaryotic 
cells grown in tissue culture. 

Recombinant genes may also be introduced into viruses, such as 
vaccina virus. Recombinant viruses can be generated by transfection of plasmids into 

30 cells infected with virus. 

Suitable vectors include, but are not limited to, the following viral 
vectors such as lambda vector system gtl 1, gt WES.tB, Charon 4, and plasmid vectors 
such as pBR322, pBR325, pACYC177, pACYC1084, pUC8, pUC9, pUC18, pUC19, 
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pLG339, pR290, pKC37, pKC-101, SV 40, pBluescript II SK +/- or KS +/- (see 
"Stratagene Cloning Systems" Catalog (1993) from Stratagene, La Jolla, Calif, which 
is hereby incorporated by reference), pQE, pIH821, pGEX, pET series (see 
F.W. Studier et al., "Use of T7 RNA Polymerase to Direct Expression of Cloned 
5 Genes," Gene Expression Technology vol. 1 85 (1990), which is hereby incorporated 

by reference), and any derivatives thereof Recombinant molecules can be introduced 
into cells via transformation, particularly transduction, conjugation, mobilization, or 
electroporation. The DNA sequences are cloned into the vector using standard 
cloning procedures in the art, as described by Sambrook et al., Molecular Cloning: A 

10 Laboratory Manual , Cold Springs Laboratory, Cold Springs Harbor, New York 

(1989), which is hereby incorporated by reference. 

A variety of host-vector systems may be utilized to express the protein- 
encoding sequence(s). Primarily, the vector system must be compatible with the host 
cell used. Host-vector systems include but are not limited to the following: bacteria 

15 transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 

microorganisms such as yeast containing yeast vectors; mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected 
with virus (e.g., baculovirus); and plant cells infected by bacteria. The expression 
elements of these vectors vary in their strength and specificities. Depending upon the 

20 host- vector system utilized, any one of a number of suitable transcription and 

translation elements can be used. 

Different genetic signals and processing events control many levels of 
gene expression (e.g., DNA transcription and messenger RNA (mRNA) translation). 

Transcription of DNA is dependent upon the presence of a promotor 

25 which is a DNA sequence that directs the binding of RNA polymerase and thereby 

promotes mRNA synthesis. The DNA sequences of eucaryotic promoters differ from 
those of procaryotic promoters. Furthermore, eucaryotic promoters and accompanying 
genetic signals may not be recognized in or may not function in a procaryotic system, 
and, further procaryotic promoters are not recognized and do not function in 

30 eucaryotic cells. 

Similarly, translation of mRNA in procaryotes depends upon the 
presence of the proper procaryotic signals which differ from those of eukaryotes. 
Efficient translation of mRNA in procaryotes requires a ribosome binding site called 
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the Shine-Dalgarno ("SD") sequence on the mRNA. This sequence is a short 
nucleotide sequence of mRNA that is located before the same codon, usually AUG, 
which encodes the amino-terminal methionine of the protein. The SD sequences are 
complementary to the 3 '-end of the 16S rRNA (ribosomal RNA) and probably 
promote binding of mRNA to ribosomes by duplexing with the rRNA to allow correct 
positioning of the ribosome. For a review on maximizing gene expression, see 
Roberts and Lauer, Methods in Enzymology , 68:473 (1979), which is hereby 
incorporated by reference. 

Promoters vary in their "strength" (i.e. their ability to promote 
transcription). For the purposes of expressing a cloned gene, it is desirable to use 
strong promoters in order to obtain a high level of transcription and, hence, expression 
of the gene. Depending upon the host cell system utilized, any one of a number of 
suitable promoters may be used. For instance, when cloning in E, colU its 
bacteriophages, or plasmids, promoters such as the T7 phage promoter, lac promotor, 
trp promotor, recA promotor, ribosomal RNA promotor, the Pr and Pl promoters of 
coliphage lambda and others, including but not limited, to /acUV5, ompF, bla* lpp 9 
and the like, may be used to direct high levels of transcription of adjacent DNA 
segments. Additionally, a hybrid trp-lacUV5 (tac) promotor or other E. coli 
promoters produced by recombinant DNA or other synthetic DNA techniques may be 
used to provide for transcription of the inserted gene. 

Bacterial host cell strains and expression vectors may be chosen which 
inhibit the action of the promotor unless specifically induced. In certain operations, 
the addition of specific inducers is necessary for efficient transcription of the inserted 
DNA. For example, the lac operon is induced by the addition of lactose or IPTG 
(isopropylthio-beta-D-galactoside). A variety of other operons, such as trp, pro, etc., 
are under different controls. Additionally, the cell may carry the gene for a 
heterologous RNA polymerase such as from phage T7. Thus, a promoter specific for 
T7 RNA polymerase is used. The T7 RNA polymerase may be under inducible 
control. 

Specific initiation signals are also required for efficient gene 
transcription and translation in procaryotic cells. These transcription and translation 
initiation signals may vary in "strength" as measured by the quantity of gene specific 
messenger RNA and protein synthesized, respectively. The DNA expression vector, 
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which contains a promoter, may also contain any combination of various "strong" 
transcription and/or translation initiation signals. For instance, efficient translation in 
E. coli requires an SD sequence about 7-9 bases 5' to the initiation codon ("ATG") to 
provide a ribosome binding site. Thus, an SD-ATG combination that can be utilized 
5 by host cell ribosomes may be employed. Such combinations include but are not 

limited to the SD-ATG combination from the cro gene or the N gene of coliphage 
lambda, or from the E. coli tryptophan E, D, C, B or A genes. Additionally, any SD- 
ATG combination produced by recombinant DNA or other techniques involving 
incorporation of synthetic nucleotides may be used. 

10 Once the isolated DNA molecule encoding a replication polypeptide or 

protein has been cloned into an expression system, it is ready to be incorporated into a 
host cell. Such incorporation can be carried out by the various forms of 
transformation noted above, depending upon the vector/host cell system. Suitable host 
cells include, but are not limited to, bacteria, viruses, yeast, mammalian cells, insects, 

15 plants, and the like. 

The invention provides efficient methods of identifying 
pharmacological agents or lead compounds for agents active at the level of a 
replication protein function, particularly DNA replication. Generally, these screening 
methods involve assaying for compounds which interfere with the replication activity. 

20 The methods are amenable to automated, cost-effective high throughput screening of 

chemical libraries for lead compounds. Identified reagents find use in the 
pharmaceutical industries for animal and human trials; for example, the reagents may 
be derivatized and rescreened in in vitro and in vivo assays to optimize activity and 
minimize toxicity for pharmaceutical development. Target therapeutic indications are 

25 limited only in that the target cellular function be subject to modulation, usually 

inhibition, by disruption of a replication activity or the formation of a complex 
comprising a replication protein and one or more natural intracellular binding targets. 
Target indications may include arresting cell growth or causing cell death resulting in 
recovery from the bacterial infection in animal studies. 

30 A wide variety of assays for activity and binding agents are provided, 

including DNA synthesis, ATPase, clamp loading onto DNA, protein-protein binding 
assays, immunoassays, cell based assays, etc. The replication protein compositions, 
used to identify pharmacological agents, are in isolated, partially pure or pure form 
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and are typically recombinantly produced. The replication protein may be part of a 
fusion product with another peptide or polypeptide (e.g., a polypeptide that is capable 
of providing or enhancing protein-protein binding, stability under assay conditions 
(e.g., a tag for detection or anchoring), etc.). The assay mixtures comprise a natural 
5 intracellular replication protein binding target such as DNA, another protein, NTP, or 

dNTP. For binding assays, while native binding targets may be used, it is frequently 
preferred to use portions (e.g., peptides, nucleic acid fragments) thereof so long as the 
portion provides binding affinity and avidity to the subject replication protein 
conveniently measurable in the assay. . The assay mixture also comprises a candidate 

10 pharmacological agent. Generally, a plurality of assay mixtures are run in parallel 

with different agent concentrations to obtain a differential response to the various 
concentrations. Typically, one of these concentrations serves as a negative control 
(i.e., at zero concentration or below the limits of assay detection). Additional controls 
are often present such as a positive control, a dose response curve, use of known 

15 inhibitors, use of control heterologous proteins, etc. Candidate agents encompass 

numerous chemical classes, though typically they are organic compounds; preferably 
they are small organic compounds and are obtained from a wide variety of sources, 
including libraries of synthetic or natural compounds. A variety of other reagents may 
also be included in the mixture. These include reagents like salts, buffers, neutral 

20 proteins (e.g., albumin, detergents, etc.), which may be used to facilitate optimal 

binding and/or reduce nonspecific or background interactions, etc. Also reagents that 
otherwise improve the efficiency of the assay (e.g., protease inhibitors, nuclease 
inhibitors, antimicrobial agents, etc.) may be used. 

The invention provides replication protein specific assays and the 

25 binding agents including natural intracellular binding targets such as other replication 

proteins, etc., and methods of identifying and making such agents, and their use in a 
variety of diagnostic and therapeutic applications, especially where disease is 
associated with excessive cell growth. Novel replication protein-specific binding 
agents include replication protein-specific antibodies and other natural intracellular 

30 binding agents identified with assays such as one- and two-hybrid screens, non-natural 

intracellular binding agents identified in screens of chemical libraries, etc. 

Generally, replication protein-specificity of the binding agent is shown 
by binding equilibrium constants. Such agents are capable of selectively binding a 
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replication protein (i.e., with an equilibrium constant at least about 10 7 M~\ 
preferably, at least about 10 8 M~\ more preferably, at least about 10 9 M' 1 ). A wide 
variety of cell-based and cell-free assays may be used to demonstrate replication 
protein-specific activity, binding, gel shift assays, immunoassays, etc. 
5 The resultant mixture is incubated under conditions whereby, but for 

the presence of the candidate pharmacological agent, the replication protein 
specifically binds the cellular binding target, portion, or analog. The mixture of 
components can be added in any order that provides for the requisite bindings. 
Incubations may be performed at any temperature which facilitates optimal binding, 

10 typically between 4°C and 40°C, more commonly between 15°C and 40°C. Incubation 

periods are likewise selected for optimal binding but also minimized to facilitate 
rapid, high-throughput screening, and are typically between 0.1 and 10 hours, 
preferably less than 5 hours, more preferably less than 2 hours. 

After incubation, the presence or absence of activity or specific binding 

15 between the replication protein and one or more binding targets is detected by any 

convenient way. For cell-free activity and binding type assays, a separation step may 
be used to separate the activity product or the bound from unbound components. 
Separation maybe effected by precipitation (e.g., immunoprecipitation), 
immobilization (e.g., on a solid substrate such as a microtiter plate), etc., followed by 

20 washing. Many assays that do not require separation are also possible such as use of 

europium conjugation in proximity assays or a detection system that is dependent on a 
product or loss of substrate. 

Detection may be effected in any convenient way. For cell-free activity 
and binding assays, one of the components usually comprises or is coupled to a label. 

25 A wide variety of labels may be employed - essentially any label that provides for 

detection of DNA product, loss of DNA substrate, conversion of a nucleotide 
substrate, or bound protein is useful. The label may provide for direct detection such 
as radioactivity, fluorescence, luminescence, optical, or electron density, etc. or 
indirect detection such as an epitope tag, an enzyme, etc. The label may be appended 

30 to the protein (e.g., a phosphate group comprising a radioactive isotope of 

phosphorous), or incorporated into the DNA substrate or the protein structure (e.g., a 
methionine residue comprising a radioactive isotope of sulfur.) A variety of methods 
may be used to detect the label depending on the nature of the label and other assay 
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components. For example, the label may be detected bound to the solid substrate, or a 
portion of the bound complex containing the label may be separated from the solid 
substrate, and thereafter the label detected. Labels may be directly detected through 
optical or electron density, radioactive emissions, nonradiative energy transfer, 
5 fluorescence emission, etc. or indirectly detected with antibody conjugates, etc. For 

example, in the case of radioactive labels, emissions may be detected directly (e.g., 
with particle counters) or indirectly (e.g., with scintillation cocktails and counters). 

The present invention identifies the set of proteins that together result 
in a three component polymerase from bacteria that are distantly related to E. coli, 

10 such as Gram positive bacteria. Specifically, these bacteria lack several genes that E. 

coli DNA polymerase HI has, such as holD, holD or holE. Further, dnaX 'xs believed 
to encode only one protein, tau. Also, holA is quite divergent in homology suggesting 
it may function in another process in these organisms. Gram positive cells even have 
replication genes that E. coli does not, implying that they may not utilize the 

15 replication strategies exemplified by E. coli. 

The present invention identifies genes and proteins that form a three 
component polymerase in Gram positive organisms, such as S. pyogenes and S. 
aureus. In S. pyogenes and S. aureus, the polymerase a— large, functions with a 
p clamp and a clamp loader component of t86'. They display high speed and 

20 processivity in synthesis of ssDNA coated with SSB and primed with a DNA 

oligonucleotide. 

This invention also expresses and purifies a protein from a Gram 
positive bacteria that is homologous to the E. coli beta subunit. The invention 
demonstrates that it behaves like a circular protein. Further, this invention shows that 

25 a beta subunit from a Gram positive bacteria is functional with both Pol IH-L (a-large) 

from a Gram positive bacteria and with DNA polymerase IU from a Gram negative 
bacteria. This result can be explained by an interaction between the clamp and the 
polymerase that has been conserved during the evolutionary divergence of Gram 
positive and Gram negative cells. A chemical inhibitor that would disrupt this 

30 interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 

down replication in gram negative and gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
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through recombinant techniques, sufficient quantities of the protein reagents can be 
obtained for screening hundreds of thousands of compounds. 

This invention also shows that the DnaE polymerase (a-small), 
encoded by the dnaE gene, functions with the beta clamp and x55' complex. The 
5 speed of DnaE is not significantly increased by tS8' and P, but the processivity of 

DnaE is greatly increased by t55' and p. Hence, the DnaE polymerase, coupled with 
its P clamp on DNA (loaded by x55') may also be an important target for a candidate 
pharmaceutical drug. 

The present invention provides methods by which replication proteins 

10 from a Gram positive bacteria are used to discover new pharmaceutical agents. The 

function of replication proteins is quantified in the presence of different chemical 
compounds. A chemical compound that inhibits the function is a candidate antibiotic. 
Some replication proteins from a Gram positive bacteria and from a Gram negative 
bacteria can be interchanged for one another. Hence, they can function as mixtures. 

15 Reactions that assay for the function of enzyme mixtures consisting of proteins from 

Gram positive bacteria and from Gram negative bacteria can also be used to discover 
drugs. Suitable E. coli replication proteins are the subunits of its Pol HI holoenzyme 
which are described in U.S. Patent Nos. 5,583,026 and 5,668,004 to O'Donnell, which 
are hereby incorporated by reference. 

20 The methods described herein to obtain genes, and the assays 

demonstrating activity behavior of S. pyogenes and S. aureus replication proteins are 
likely to generalize to all members of the Streptococcus and Staphylococcus genuses, 
as well as to all Gram positive bacteria. Such assays are also likely to generalize to 
other cells besides Gram positive bacteria which also share features in common with 

25 S. pyogenes and S. aureus that are different from E. coli (i.e., lacking holC, holD, or 

holE; having a dnaX gens encoding a single protein; or having a weak homology to 
holA encoding delta). 

The present invention describes a method of identifying compounds 
which inhibit the activity of a polymerase product of polC or dnaE. This method is 

30 carried out by forming a reaction mixture that includes a primed DNA molecule, a 

polymerase product of polC or dnaE, a candidate compound, a dNTP, and optionally 
either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product of polC or dnaE, the beta subunit, the 
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tau complex, or a subunit or combination of subunits thereof is derived from a 
Eubacteria other than Escherichia coli; subjecting the reaction mixture to conditions 
effective to achieve nucleic acid polymerization in the absence of the candidate 
compound; analyzing the reaction mixture for the presence or absence of nucleic acid 
5 polymerization extension products; and identifying the candidate compound in the 

reaction mixture where there is an absence of nucleic acid polymerization extension 
products. Preferably, the polymerase product of polC or dnaE, the beta subunit, the 
tau complex, or the subunit or combination of subunits thereof is derived from a Gram 
positive bacterium, more preferably a Streptococcus bacterium such as S. pyogenes or 

10 a Staphylococcus bacterium such as S. aureus. 

The present invention describes a method to identify chemicals that 
inhibit the activity of the three component polymerase. This method involves 
contacting primed DNA with the DNA polymerase in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 

15 reaction mixture is subjected to conditions effective to achieve nucleic acid 

polymerization in the absence of the candidate pharmaceutical and the presence or 
absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is detected by the absence of product. 

The present invention describes a method to identify candidate 

20 pharmaceuticals that inhibit the activity of a clamp loader complex and a beta subunit 

in stimulating the DNA polymerase. The method includes contacting a primed DNA 
(which may be coated with SSB) with a DNA polymerase, a beta subunit, and a tau 
complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction 

25 mixture. The reaction mixture is subjected to conditions which, in the absence of the 

candidate pharmaceutical, would effect nucleic acid polymerization and the presence 
or absence of the extension product in the reaction mixture is analyzed. The candidate 
pharmaceutical is detected by the absence of product. The DNA polymerase, the beta 
subunit, and/or the tau complex or subunit(s) thereof are derived from a Gram positive 

30 bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a DNA polymerase to interact physically. This 
method involves contacting the beta subunit with the DNA polymerase in the presence 
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of the candidate pharmaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the DNA polymerase and the beta subunit would 
interact in the absence of the candidate pharmaceutical. The reaction mixture is then 
analyzed for interaction between the beta unit and the DNA polymerase. The 
candidate pharmaceutical is detected by the absence of interaction between the beta 
subunit and the DNA polymerase. The DNA polymerase and/or the beta subunit are 
derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a beta subunit and a tau complex (or a subunit or subassembly of 
the tau complex) to interact. This method includes contacting the beta subunit with 
the tau complex (or subunit or subassembly of the tau complex) in the presence of the 
candidate pharmaceutical to form a reaction mixture. The reaction mixture is 
subjected to conditions under which the tau complex (or the subunit or subassembly 
of the tau complex) and the beta subunit would interact in the absence of the candidate 
pharmaceutical. The reaction mixture is then analyzed for interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex). 
The candidate pharmaceutical is detected by the absence of interaction between the 
beta subunit and the tau complex (or the subunit or subassembly of the tau complex) . 
The beta subunit and/or the tau complex or subunit thereof is derived from a Gram 
positive bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subassembly of the tau complex) to assemble 
a beta subunit onto a DNA molecule. This method involves contacting a circular 
primed DNA molecule (which may be coated with SSB) with the tau complex (or the 
subassembly thereof) and the beta subunit in the presence of the candidate 
pharmaceutical, and ATP or dATP to form a reaction mixture. The reaction mixture 
is subjected to conditions under which the tau complex (or subassembly) assembles 
the beta subunit on the DNA molecule absent the candidate pharmaceutical. The 
presence or absence of the beta subunit on the DNA molecule in the reaction mixture 
is analyzed. The candidate pharmaceutical is detected by the absence of the beta 
subunit on the DNA molecule. The beta subunit and/or the tau complex are derived 
from a Gram positive bacterium. 
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The present invention describes a method to identify chemicals that 
inhibit the ability of a tau complex (or a subunit(s) of the tau complex) to disassemble 
a beta subunit from a DNA molecule. This method comprises contacting a DNA 
molecule onto which the beta subunit has been assembled in the presence of the 
5 candidate pharmaceutical, to form a reaction mixture. The reaction mixture is 

subjected to conditions under which the tau complex (or a subiinit(s) or subassembly 
of the tau complex) disassembles the beta subunit from the DNA molecule absent the 
candidate pharmaceutical. The presence or absence of the beta subunit on the DNA 
molecule in the reaction mixture is analyzed. The candidate pharmaceutical is 

10 detected by the presence of the beta subunit on the DNA molecule. The beta subunit 

and/or the tau complex are derived from a Gram positive bacterium. 

The present invention describes a method to identify chemicals that 
disassemble a beta subunit from a DNA molecule. This method involves contacting a 
circular primed DNA molecule (which may be coated with SSB) upon which the beta 

15 subunit has been assembled (e.g. by action of the tau complex) with the candidate 

pharmaceutical. The presence or absence of the beta subunit on the DNA molecule in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of the beta subunit on the DNA molecule. The beta subunit is derived from a 
Gram positive bacterium. 

20 The present invention describes a method to identify chemicals that 

inhibit the dATP/ATP binding activity of a tau complex or a tau complex subunit (e.g. 
tau subunit). This method includes contacting the tau complex (or the tau complex 
subunit) with dATP/ATP either in the presence or absence of a DNA molecule and/or 
the beta subunit in the presence of the candidate pharmaceutical to form a reaction. 

25 The reaction mixture is subjected to conditions in which the tau complex (or the 

subunit of tau complex) interacts with dATP/ATP in the absence of the candidate 
pharmaceutical. The reaction is analyzed to determine if dATP/ATP is bound to the 
tau complex (or the subunit of tau complex) in the presence of the candidate 
pharmaceutical. The candidate pharmaceutical is detected by the absence of 

30 hydrolysis. The tau complex and/or the beta subunit is derived from a Gram positive 

bacterium. 

The present invention describes a method to identify chemicals that 
inhibit the dATP/ATPase activity of a tau complex or a tau complex subunit (e.g., the 
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tau subunit). This method involves contacting the tau complex (or the tau complex 
subunit) with dATP/ATP either in the presence or absence of a DNA molecule and/or 
a beta subunit in the presence of the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions in which the tau subunit (or 
5 complex) hydrolyzes dATP/ATP in the absence of the candidate pharmaceutical. The 

reaction is analyzed to determine if dATP/ATP was hydrolyzed. Suitable candidate 
pharmaceuticals are identified by the absence of hydrolysis. The tau complex and/or 
the beta subunit is derived from a Gram positive bacterium. 

Further methods for identifying chemicals that inhibit the activity of a 

10 DNA polymerase encoded by either the dnaE gene, polC gene, or their accessory 

proteins (i.e., clamp loader, clamp, etc.), are as follows: 

1) Contacting a primed DNA molecule with the encoded product 
of the dnaE gene or polC gene in the presence of the candidate pharmaceutical, and 
dNTPs (or modified dNTPs) to form a reaction mixture. The reaction mixture is 

15 subjected to conditions, which in the absence of the candidate pharmaceutical, affect 

nucleic acid polymerization and the presence or absence of the extension product in 
the reaction mixture is analyzed. The candidate pharmaceutical is detected by the 
absence of extension product. The protein encoded by the dnaE gene and PolC gene 
is derived from a Gram positive bacterium. 

20 2) Contacting a linear primed DNA molecule with a beta subunit 

and the encoded product of dnaE or PolC in the presence of the candidate 
pharmaceutical, and dNTPs (or modified dNTPs) to form a reaction mixture. The 
reaction mixture is subjected to conditions, which in the absence of the candidate 
pharmaceutical, affect nucleic acid polymerization, and the presence or absence of the 

25 extension product in the reaction mixture is analyzed. The candidate pharmaceutical 

is detected by the absence of extension product. The protein encoded by the dnaE 
gene and PolC gene is derived from a Gram positive bacterium. 

3) Contacting a circular primed DNA molecule (may be coated 
with SSB) with a tau complex, a beta subunit and the encoded product of a dnaE gene 

30 . or PolC gene in the presence of the candidate pharmaceutical, and dNTPs (or 

modified dNTPs) to form a reaction mixture. The reaction mixture is subjected to 
conditions, which in the absence of the candidate pharmaceutical, affect nucleic acid 
polymerization, and the presence or absence of the extension product in the reaction 
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mixture is analyzed. The candidate pharmaceutical is detected by the absence of 
product. The protein encoded by the dnaE gene and PolC gene, the beta subunit, 
and/or the tau complex are derived from a Gram positive bacterium. 

4) Contacting a beta subunit with the product encoded by a dnaE 
5 gene or PolC gene in the presence of the candidate pharmaceutical to form a reaction 

mixture. The reaction mixture is then analyzed for interaction between the beta 
subunit and the product encoded by the dnaE gene or PolC gene. The candidate 
pharmaceutical is detected by the absence of interaction between the beta subunit and 
the product encoded by the dnaE gene or PolC gene. The beta subunit and/or the 
10 protein encoded by the dnaE gene and PolC gene is derived from a Gram positive 

bacterium. 

5) The present invention discloses a method to identify chemicals 
that inhibit a DnaB helicase. The method includes contacting the DnaB helicase with 
a DNA molecule substrate that has a duplex region in the presence of a nucleoside or 

15 deoxynucleoside triphosphate energy source and a candidate pharmaceutical to form a 

reaction mixture. The reaction mixture is subjected to conditions that support helicase 
activity in the absence of the candidate pharmaceutical. The DNA duplex molecule in 
the reaction mixture is analyzed for whether it is converted to ssDNA. The candidate 
pharmaceutical is detected by the absence of conversion of the duplex DNA molecule 

20 to the ssDNA molecule. The DnaB helicase is derived from a Gram positive 

bacterium. 

6) The present invention describes a method to identify chemicals 
that inhibit the nucleoside or deoxynucleoside triphosphatase activity of a DnaB 
helicase. The method includes contacting the DnaB helicase with a DNA molecule 

25 substrate that has a duplex region in the presence of a nucleoside or deoxynucleoside 

triphosphate energy source and the candidate pharmaceutical to form a reaction 
mixture. The reaction mixture is subjected to conditions that support nucleoside or 
deoxynucleoside triphosphatase activity of the DnaB helicase in the absence of the 
candidate pharmaceutical. The candidate pharmaceutical is detected by the absence of 

30 conversion of nucleoside or deoxynucleoside triphosphate to nucleoside or 

deoxynucleoside diphosphate. The DnaB helicase is derived from a Gram positive 
bacterium. 
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7) The present invention describes a method to identify chemicals 
that inhibit a primase. The method includes contacting primase with a ssDNA 
molecule in the presence of a candidate pharmaceutical to form a reaction mixture. 
The reaction mixture is subjected to conditions that support primase activity (e.g., the 

5 presence of nucleoside or deoxynucleoside triphosphates, appropriate buffer, presence 

or absence of DnaB helicase) in the absence of the candidate pharmaceutical. Suitable 
candidate pharmaceuticals are identified by the absence of primer formation detected 
either directly or indirectly. The primase is derived from a Gram positive bacterium. 

8) The present invention describes a method to identify chemicals 
10 that inhibit the ability of a primase and the protein encoded by a dnaB gene to interact. 

This method includes contacting the primase with the protein encoded by the dnaB 
gene in the presence of the candidate pharmaceutical to form a reaction mixture. The 
reaction mixture is subjected to conditions under which the primase and the protein 
encoded by the dnaB gene interact in the absence of the candidate pharmaceutical. 
15 The reaction mixture is then analyzed for interaction between the primase and the 

protein encoded by the dnaB gene. The candidate pharmaceutical is detected by the 
absence of interaction between the primase and the protein encoded by the dnaB gene. 
The primase and/or the dnaB gene are derived from a Gram positive bacterium. 

9) The present invention describes a method to identify chemicals 
20 that inhibit the ability of a protein encoded by a dnaB gene to interact with a DNA 

molecule. This method includes contacting the protein encoded by the dnaB gene 
with the DNA molecule in the presence of the candidate pharmaceutical to form a 
reaction mixture. The reaction mixture is subjected to conditions under which the 
DNA molecule and the protein encoded by the dnaB gene interact in the absence of 
25 the candidate pharmaceutical. The reaction mixture is then analyzed for interaction 

between the protein encoded by the dnaB gene and the DNA molecule. The candidate 
pharmaceutical is detected by the absence of interaction between the DNA molecule 
and the protein encoded by the dnaB gene. The dnaB gene is derived from a Gram 
positive bacterium. 
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EXAMPLES 

The following examples are provided to illustrate embodiments of the 
present invention, but they are by no means intended to limit its scope. 

5 

Example 1 - Materials 

Labeled deoxy- and ribonucleoside triphosphates were from Dupont- 
New England Nuclear; unlabelled deoxy- and ribonucleoside triphosphates were from 

10 Pharmacia-LKB; E. coli replication proteins were purified as described, alpha, 

epsilon, gamma, and tau (Studwell et al., "Processive Replication is Contingent on the 
Exonuclease Subunit of DNA Polymerase Id Holoenzyme," J. Biol. Chem. , 265:1 1 Ti- 
ll 78 (1990), which is hereby incorporated by reference), beta (Kong et al., "Three 
Dimensional Structure of the Beta Subunit of Escherichia coli DNA Polymerase HI 

15 Holoenzyme: A Sliding DNA Clamp," Cell , 69:425-437 (1992), which is hereby 

incorporated by reference), delta and delta prime (Dong et al., "DNA Polymerase m 
Accessory Proteins. I. HolA and holB Encoding 5 and 5'," J. Biol. Chem. , 268:1 1758- 
1 1765 (1993), which is hereby incorporated by reference), chi and psi (Xiao et al., 
"DNA Polymerase HI Accessory Proteins. HI. HolC and holD Encoding chi and psi," 

20 J. Biol. Chenu 268:1 1773-1 1778 (1993), which is hereby incorporated by reference), 

theta (Studwell-Vaughan et al., "DNA Polymerase III Accessory Proteins. V. Theta 
Encoded by holE? J. Biol. Chem., 268:1 1785-1 1791 (1993), which is hereby 
incorporated by reference), and SSB (Weiner et al., "The Deoxyribonucleic Acid 
Unwinding Protein of Escherichia coli" J. Biol. Chem., 250:1972-1980 (1975), 

25 which is hereby incorporated by reference). E. coli Pol HI core and clamp loader 

complex (composed of subunits gamma, delta, delta prime, chi, and psi) were 
reconstituted as described in Onrust et al., "Assembly of a Chromosomal Replication 
Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps in One 
Holoenzyme Particle. I. Organization of the Clamp Loader," J. Biol. Chem. , 

30 270: 13348-13357 (1995), which is hereby incorporated by reference. Pol m* was 

reconstituted and purified as described in Onrust et al., "Assembly of a Chromosomal 
Replication Machine: Two DNA Polymerases, a Clamp Loader and Sliding Clamps 
in One Holoenzyme Particle. HI. Interface Between Two Polymerases and the Clamp 
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Loader," J. Biol. Chem. , 270:13366-13377 (1995), which is hereby incorporated by 
reference. Protein concentrations were quantitated by the Protein Assay (Bio-Rad) 
method using bovine serum albumin (BSA) as a standard. DNA oligonucleotides 
were synthesized by Oligos etc. Calf thymus DNA was from Sigma. Buffer A is 20 
5 mM Tris-HCl (pH=7.5), 0.5 raM EDTA, 2 mM DTT, and 20% glycerol. Replication 

buffer is 20 mM Tris-Cl (pH 7.5), 8 mM MgCl 2 , 5 mM DTT, 0.5 mM EDTA, 40 
Mg/ml BSA, 4% glycerol, 0.5 mM ATP, 3 mM each dCTP, dGTP, dATP, and 20 |xM 
[a- 32 P]dTTP. P-cell buffer is 50 mM potassium phosphate (pH 7.6), 5 mM DTT, 0.3 
mM EDTA, 20% glycerol. T.E. buffer is 10 mM Tris-HCl (pH 7.5), 1 mM EDTA. 
10 Cell lysis buffer is 50 mM Tris-HCl (pH 8.0) 10 % sucrose, 1 M NaCl, 0.3 mM 

spermidine. 

Example 2 - Calf Thymus DNA Replication Assays 

15 These assays were used in the purification of DNA polymerases from 

S. aureus cell extracts. Assays contained 2.5 fig activated calf thymus DNA in a final 
volume of 25 ^1 replication buffer. An aliquot of the fraction to be assayed was added 
to the assay mixture on ice followed by incubation at 37°C for 5 min. DNA synthesis 
was quantitated using DE81 paper as described in Rowen et ah, "Primase, the DnaG 

20 Protein of Escherichia coli. An Enzyme Which Starts DNA Chains," J. Biol. Chem. , 

253:758-764 (1979), which is hereby incorporated by reference. 

Example 3 - PolydA-oligodT Replication Assays 

25 PolydA-oligodT was prepared as follows. PolydA of average length 

4500 nucleotides was purchased from SuperTecs. OhgodT35 was synthesized by 
Oligos etc. 145 ul of 5.2 mM (as nucleotide) polydA and 22 pi of 1 .75 mM (as 
nucleotide) oligodT were mixed in a final volume of 2100 pi T.E. buffer (ratio as 
nucleotide was 21:1 polydA to oligodT). The mixture was heated to boiling in a 1 ml 

30 eppendorf tube, then removed and allowed to cool to room temperature. Assays were 

performed in a final volume of 25 ^1 20 mM Tris-Cl (pH 7.5), 8 mM MgC^, 5 mM 
DTT, 0.5 mM EDTA, 40 ng/ml BSA, 4% glycerol, containing 20 M M [a- 32 P]dTTP 
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and 0.36 ^ig polydA-oligodT. Proteins were added to the reaction on ice, then shifted 
to 37°C for 5 min. DNA synthesis was quantitated using DE81 paper as described in 
Rowen et al., "Primase, the DnaG Protein of Escherichia coli. An Enzyme Which 
Starts DNA Chains," J. Biol. Chem. , 253:758-764 (1979), which is hereby 
incorporated by reference. 

Example 4 - Singly Primed M13mpl8 ssDNA Replication Assays 

M13mpl8 was phenol extracted from phage and purified by two 
successive bandings (one downward and one upward) in cesium chloride gradients. 
M13mpl8 ssDNA was singly primed with a DNA 30mer (map position 6817-6846) as 
described in Studwell et al. "Processive Replication is Contingent on the Exonuclease 
Subunit of DNA Polymerase m Holoenzyme," J. Biol. Chem., 265:1 171-1 178 (1990), 
which is hereby incorporated by reference. Replication assays contained 72 ng of 
singly primed M13mpl8 ssDNA in a final volume of 25 nl of replication buffer. 
Other proteins added to the assay, and their amounts, are indicated in the Brief 
Description of the Drawings. Reactions were incubated for 5 min. at 37°C and then 
were quenched upon adding an equal volume of 1% SDS and 40 mM EDTA. DNA 
synthesis was quantitated using DE81 paper as described in Rowen et al., "Primase, 
the DnaG Protein of Escherichia coli. An Enzyme Which Starts DNA Chains," L 
Biol. Cherru 253:758-764 (1979), which is hereby incorporated by reference, and 
product analysis was performed in a 0.8% native agarose gel followed by 
autoradio graphy. 

Example 5 - Genomic Staphylococcus aureus DNA 

Two strains of S. aureus were used. For PCR of the first fragment of 
the dnaX gene sequence, the strain was ATCC 25923. For all other work the strain 
was strain 4220 (a gift of Dr. Pat Schlievert, University of Minnisota). This strain 
lacks a gene needed for producing toxic shock (Kreiswirth et al., "The Toxic Shock 
Syndrome Exotoxin Structural Gene is Not Detectably Transmitted by a Prophage," 
Nature , 305:709-712 (1996) and Balan et al., "Autocrine Regulation of Toxin 
Synthesis by Staphylococcus aureus" Proc. Natl. Acad. Sci. USA , 92:1619-1623 
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(1995), which are hereby incorporated by reference). S. aureus cells were grown 
overnight at 37°C in LB containing 0.5% glucose. Cells were collected by 
centrifugation (24 g wet weight). Cells were resuspended in 80 ml solution I (50 raM 
glucose, 10 mM EDTA, 25 mM Tris-HCL (pH 8.0)). SDS and NaOH were then 
5 added to 1% and 0.2 N, respectively, followed by incubation at 65°C for 30 min. to 

lyse the cells. 68.5 ml of 3 M sodium acetate (pH 5.0) was added followed by 
centrifugation at 12,000 rpm for 30 min. The supernatant was discarded and the pellet 
was washed twice with 50 ml of 6M urea, 10 mM Tris-HCL (pH 7.5), 1 mM EDTA 
using a dounce homogenizer. After each wash, the resuspended pellet was collected 

10 by centrifugation (1 2,000 rpm for 20 min.). After the second wash, the pellet was 

resuspended in 50 ml 10 mM T.E. buffer using a dounce homogenizer and then 
incubated for 30 min. at 65°C. The solution was centrifuged at 12,000 rpm for 20 
min., and the viscous supernatant was collected. 43.46 g CsCh was added to the 50 
ml of supernatant (density between 1.395-1.398) and poured into two 35 ml quick seal 

15 ultracentrifuge tubes (tubes were completely filled using the same density of CsCl2 in 

T.E.). To each tube was added 0.5 ml of a 10 mg/ml stock of ethidium bromide. 
Tubes were spun at 55,000 rpm for 18 h at 18°C in a Sorvall TV860 rotor. The band 
of genomic DNA was extracted using a syringe and needle. Ethidium bromide was 
removed using two butanol extractions and then dialyzed against 4 1 of T.E. at pH 8.0 

20 overnight. The DNA was recovered by ethanol precipitation and then resuspended in 

T.E. buffer (1.7 mg total) and stored at -20°C. 

Example 6 - Cloning and Purification of S. aureus Pol III-L 

25 To further characterize the mechanism of DNA replication in S. 

aureus, large amounts of its replication proteins were produced through use of the 
genes. The polC gene encoding S. aureus Pol IH-L (alpha-large) subunit has been 
sequenced and expressed in E. coli (Pacitti et al., "Characterization and 
Overexpression of the Gene Encoding Staphylococcus aureus DNA Polymerase m," 

30 Gene , 165:5 1-56 (1995), which is hereby incorporated by reference). The previous 

work utilized a pBS[KS] vector for expression in which the E. coli RNA polymerase 
is used for gene transcription. In the earlier study, the S. aureus polC gene was 
precisely cloned at the 5* end encoding the N-terminus, but the amount of the gene 
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that remained past the 3' end was not disclosed and the procedure for subcloning the 
gene into the expression vector was only briefly summarized. Furthermore, the 
previous study does not show the level of expression of the S. aureus Pol IH-L, nor the 
amount of S. aureus Pol m-L that is obtained from the induced cells. Since the 
5 previously published procedure could not be repeated and the efficiency of the 

expression vector could not be assessed, another strategy outlined below had to be 
developed. 

The isolated polC gene was cloned into a vector that utilizes T7 RNA 
polymerase for transcription as this process generally expresses a large amount of 

10 protein. Hence, the S. aureus polC gene was cloned precisely into the start codon at 

the Ndel site downstream of the T7 promotor in a pET vector . As the polC gene 
contains an internal Ndel site, the entire gene could not be amplified and placed it into 
the Ndel site of a pET vector. Hence, a three step cloning strategy that yielded the 
desired clone was devised (Figure 1). These attempts were quite frustrating initially 

15 as no products of cloning in standard E. coli strains such as DH5a, a typical 

laboratory strain for preparation of DNA, could be obtained. Finally, a cell that was 
mutated in several genes affecting DNA stability was useful in obtaining the desired 
products of cloning. 

In brief, the cloning strategy required use of another expression vector 

20 (called pETl 137kDa) in which the 37 kDa subunit of human RFC, the clamp loader 

of the human replication system, had been cloned into the pETl 1 vector. The gene 
encoding the 37kDa subunit contains an internal Nsil site, which was needed for the 
precise cloning of the isolated polC gene. This three step strategy is shown in 
Figure 1 . In the first step, an approximately 2.3 kb section of the 5 f section of the gene 

25 (encoding the N-terminus of Pol 1H-L) was amplified using the polymerase chain 

reaction (PCR). Primers were as follows: 



30 



Upstream (SEQ. ID. No. 35) 

ggtggtaatt gtcttgcata tgacagagc 2 9 

Downstream (SEQ. ID. No. 36) 

agcgattaag tggattgccg ggttgtgatg c 31 
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Amplification was performed using 500 ng genomic DNA, 0,5 mM EDTA, 1 ^iM of 
each primer, ImM MgSO^ 2 units vent DNA polymerase (New England Biolabs) in 

100 jlxI of vent buffer (New England Biolabs). Forty cycles were performed using the 
following cycling scheme: 94°C, 1 min; 60°C, 1 min.; 72°C, 2.5 min. The product 
5 was digested with Ndel (underlined in the upstream primer) and Nsil (an internal site 

in the product) and the approximately 1 .8 kb fragment was gel purified. A pETl 1 
vector containing as an insert the 37 kDa subunit of human replication factor C 
(pETl 137kDa) was digested with Ndel and Nsil and gel purified. The PCR fragment 
was ligated into the digested pETl 137kDa vector and the ligation reaction was 
10 transformed into Epicurean coli supercompetent SURE 2 cells (Stratagene) and 

colonies were screened for the correct chimera (pETl lPolCl) by examining 
minipreps for proper length and correct digestion products using Ndel and Nsil. In the 
second step, an approximately 2076 bp fragment containing the DNA encoding the C- 
terminus of Pol IQ-L subunit was amplified using the following sequences as primers: 

15 

Upstream (SEQ. ID. No. 37) 

agcatcacaa cccggcaatc cacttaatcg c 31 

Downstream (SEQ. ID. No. 38) 
20 gactacg cca tgg gcattaa ataaatacc 29 

The amplification cycling scheme was as described above except the elongation step 
at 72°C was for 2 min. The product was digested with BamHI (underlined in the 
downstream primer) and Nsil (internal to the product) and the approximately 480 bp 
25 product was gel purified and ligated into the pETl lPolCl that had been digested with 

Nsil/BamHI and gel purified (ligated product is pETl 1Po1C2). To complete the 
expression vector, an approximately 2080 bp PCR product was amplified over the two 
Nsil sites internal to the gene using the following primers: 

30 Upstream (SEQ. ID. No. 39) 

gaag atgcat ataaacgtgc aagacctagt 3 0 



Downstream (SEQ. ID. No. 40) 

gtctgacgca cgaattgtaa agtaag atgc atag 
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The amplification cycling scheme was as described above except the 72°C elongation 
step was 2 min. The PCR product, and the pETl lPolC2 vector, were digested with 
Nsil and gel purified. The ligation mixture was transformed as described above and 
5 colonies were screened for the correct chimera (pETl lPolC). 

To express Pol HI-L polymerase, the pETl lPolC plasmid was 
transformed into E. coli strain BL21(DE3). 24 L of E. coli BL21(DE3)pETl lPolC 
were grown in LB media containing 50 ng/ml ampicillin at 37°C to an OD of 0.7 and 
then the temperature was lowered to 15°C. Cells were then induced for Pol HI-L 

10 expression upon addition of 1 mM IPTG to produce the T7 RNA polymerase needed 

to transcribe polC. This step was followed by further incubation at 15°C for 18 h. 
Expression of S. aureus Pol EH-L polymerase was so high that it could easily be 
visualized by Coomassie staining of a SDS polyacrylamide gel of whole cells 
(Figure 2A). The expressed protein migrated in the SDS polyacrylamide gel in a 

15 position expected for a 165 kDa polypeptide. In this procedure, it is important that 

cells are induced at 15°C, as induction at 37°C produces a truncated version of Pol DI- 
L polymerase, of approximately 130 kDa. 

Cells were collected by centrifugation at 5°C. Cells (12 g wet weight) 
were stored at -70°C. The following steps were performed at 4°C. Cells were thawed 

20 and lysed in cell lysis buffer as described (final volume = 50 ml) and were passed 

through a French Press (Amico) at a minimum of 20,000 psi. PMSF (2 mM) was 
added to the lysate as the lysate was collected from the French Press. DNA was 
removed and the lysate was clarified by centrifugation. The supernatent was dialyzed 
for 1 h against Buffer A containing 50 mM NaCl. The final conductivity was 

25 equivalent to 190 mM NaCl. Supernatent (24 ml, 208 mg) was diluted to 50 ml using 

Buffer A to bring the conductivity to 96 mM MgC^, and then was loaded onto an 8 
ml MonoQ column equilibrated in Buffer A containing 50 mM NaCl. The column 
was eluted with a 160 ml linear gradient of Buffer A from 50 mM NaCl to 500 mM 
NaCl. Seventy five fractions (1 .3 ml each) were collected (Figure 2B). Aliquots were 

30 analyzed for their ability to synthesize DNA, and 20 \l\ of each fraction was analyzed 

by Coomassie staining of an SDS polyacrylamide gel. Based on the DNA synthetic 
capability, and the correct size band in the gel, fractions 56-65 containing Pol IQ-L 
polymerase were pooled (22 ml, 31 mg). The pooled fractions were dialyzed 
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overnight at 4°C against 50 mM phosphate (pH 7.6), 5 mM DTT, 0.1 mM EDTA, 2 
mM PMSF, and 20 % glycerol (P-cell buffer). The dialyzed pool was loaded onto a 
4.5 ml phosphocellulose column equilibrated in P-cell buffer, and then eluted with a 
25 ml linear gradient of P-cell buffer from 0 M NaCl to 0.5 M NaCl. Fractions of 1 
5 ml were collected and analyzed in a SDS polyacrylamide gel stained with Coomassie 

Blue (Figure 2C). Fractions 20-36 contained the majority of the Pol Hi-large at a 
purity of greater than 90 % (5 mg). 



Example 7 - *S. aureus Pol III-L is Not Processive on its Own 



10 



The Pol HI-L polymerase purifies from B. subtilis as a single subunit 
without accessory factors (Barnes et al., "Purification of DNA Polymerase HI of 
Gram-positive Bacteria," Methods in Enzy. , 262:35-42 (1995), which is hereby 
incorporated by reference). Hence, it seemed possible that it may be a Type I 

15 replicase (e.g., like T5 polymerase) and, thus, be capable of extending a single primer 

full length around a long singly primed template. To perform this experiment, a 
template M13mpl8 ssDNA primed with a single DNA oligonucleotide either in the 
presence or absence of SSB was used. DNA products were analyzed in a neutral 
agarose gel which resolved products by size. The results showed that Pol IH-L 

20 polymerase was incapable of extending the primer around the DNA (to form a 

completed duplex circle referred to as replicative form II ("RFIF')) whether SSB was 
present or not. This experiment has been repeated using more enzyme and longer 
times, but no full length RFII products are produced. Hence, Pol EH-L would appear 
not to follow the paradigm of the T5 system (Type I replicase) in which the 

25 polymerase is efficient in synthesis in the absence of any other protein(s). 

Example 8 - Cloning and Purification of S. aureus Beta Subunit 

The sequence of an S. aureus homolog of the E. coli dnaN gene 
30 (encoding the beta subunit) was obtained in a study in which the large recF region of 

DNA was sequenced (Alonso et al., "Nucleotide Sequence of the recF Gene Cluster 
From Staphylococcus aureus and Complementation Analysis in Bacillus subtilis recF 
Mutants," Mol. Gen. Genet.. 246:680-686 (1995), Alonso et al., ''Nucleotide 
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Sequence of the recF Gene Cluster From Staphylococcus aureus and 
Complementation Analysis in Bacillus subtilis recF Mutants," Mol. Gen. Genet. , 
248:635-636 (1995), which are hereby incorporated by reference). Sequence 
alignment of the S. aureus beta and E. coli beta show approximately 30% identity. 
5 Overall this level of homology is low and makes it uncertain that S. aureus beta will 

have the same shape and function as the E. coli beta subunit. 

To obtain S. aureus beta protein, the dnaN gene was isolated and 
precisely cloned into a pET vector for expression in E. coli. S. aureus genomic DNA 
was used as template to amplify the homolog of the dnaN gene (encoding the putative 
10 beta). The upstream and downstream primers were designed to isolate the dnaN gene 

by PCR amplification from genomic DNA. Primers were: 



Upstream (SEQ. ID. No. 41) 

cgactggaag gagttttaac atatg atgga attcac 36 

15 

Downstream (SEQ. ID. No. 42) 

ttatat ggat ccttagtaag ttctgattgg 3 0 

The Ndel site used for cloning into pET16b (Novagen) is underlined in the Upstream 
20 primer and the BamHI site used for cloning into pET16b is underlined in the 

Downstream primer. The Ndel and BamHI sites were used for directional cloning 
into pET16 (Figure 3). Amplification was performed using 500 ng genomic DNA, 0.5 
mM dNTPs, 1 \iM of each primer, ImM MgS04, 2 units vent DNA polymerase in 100 
ul of vent buffer. Forty cycles were performed using the following cycling scheme: 
25 94°C, 1 min; 60°C, 1 min.; 72°C > 1 min. 10s. The 1 167 bp product was digested with 

Ndel and BamHI and purified in a 0.7 % agarose gel. The pure digested fragment was 
ligated into the pET16b vector which had been digested with Ndel and BamHI and gel 
purified in a 0.7% agarose gel. Ligated products were transformed into E. coli 
competent SURE II cells (Stratagene) and colonies were screened for the correct 
30 chimera by examining minipreps for proper length and correct digestion products 

using Ndel and BamHI. 

24 L of of BL2 1 (DE3)pETbeta cells were grown in LB containing 50 
ng/ml ampicillin at 37°C to an O.D. of 0.7, and, then, the temperature was lowered to 



WO 01/09164 PCT/US00/20666 

-93- 

1 5°C. IPTG was added to a concentration of 2 mM and after a further 1 8 h at 1 5°C to 
induce expression of S. aureus beta (Figure 4A). It is interesting to note that the beta 
subunit, when induced at 37°C, was completely insoluble. However, induction of 
cells at 15°C provided strong expression of beta and, upon cell lysis, over 50% of the 
5 beta was present in the soluble fraction. 

Cells were harvested by centrifiigation (44 g wet weight) and stored at - 
70°C. The following steps were performed at 4°C. Cells (44 g wet weight) were 
thawed and resuspended in 45 ml IX binding buffer (5 mM imidizole, 0.5 M NaCl, 20 
mM Tris HC1 (final pH 7.5)) using a dounce homogenizer. Cells were lysed using a 

10 French Pressure cell (Aminco) at 20,000 psi, and then 4.5 ml of 10 % polyamine P 

(Sigma) was added. Cell debris and DNA was removed by centrifiigation at 13,000 
rpm for 30 min. at 4°C. The pET16beta vector places a 20 residue leader containing 
10 histidine residues at the N-terminus of beta. Hence, upon lysing the cells, the 
S. aureus beta was greatly purified by chromatography on a nickel chelate resin 

15 (Figure 4B). The supernatant (890 mg protein) was applied to a 10 ml HiTrap 

Chelating Separose column (Pharmacia-LKB) equilibrated in binding buffer. The 
column was washed with binding buffer, then eluted with a 100 ml linear gradient of 
60 mM imidazole to 1 M imidazole in binding buffer. Fractions of 1 .35 ml were 
collected. Fractions were analyzed for the presence of beta in an SDS polyacrylamide 

20 gel stained with Coomassie Blue. Fractions 28-52, containing most of the beta 

subunit, were pooled (35 ml, 82 mg). Remaining contaminating protein was removed 
by chromatography on MonoQ. The S. aureus beta becomes insoluble as the ionic 
strength is lowered and, thus, the pool of beta was dialyzed overnight against Buffer A 
containing 400 mM NaCl. The dialyzed pool became slightly turbid indicating it was 

25 at its solubility limit at these concentrations of protein and NaCl. The insoluble 

material was removed by centrifiigation (64 mg remaining) and, then, diluted 2-fold 
with Buffer A to bring the conductivity to 256. The protein was then applied to an 8 
ml MonoQ column equilibrated in Buffer A plus 250 mM NaCl and then eluted with a 
100 ml linear gradient of Buffer A from 0.25M NaCl to 0.75 M NaCl; fractions of 

30 1.25 ml were collected (Figure 4C). . Under these conditions, approximately 27 mg of 

the beta flowed through the column and the remainder eluted in fractions 1-18 
(24 mg). 
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Example 9 - The S. aureus Beta Subunit Protein Stimulates S. aureus Pol III-L 
stndE. coli Core 

The experiment of Figure 5 A, tests the ability of S. aureus beta to 
5 stimulate S. aureus Pol IH-L on a linear polydA-oligodT template. Reactions are also 

performed with E. coli beta and Pol HI core. The linear template was polydA of 
average length of 4500 nucleotides primed with a 30mer oligonucleotide of T 
residues. The first two lanes show the activity of Pol IH-L either without (lane 1) or 
with S. aureus beta (lane 2). The result shows that the S. aureus beta stimulates Pol 

10 IQ-L approximately 5-6 fold. Lanes 5 and 6 show the corresponding experiment using 

E. coli core with (lane 6) or without (lane 5) E. coli beta. The core is stimulated over 
10-fold by the E. coli beta subunit under the conditions used. 

Although Gram positive and Gram negative cells diverged from one 
another long ago and components of one polymerase machinery would not be 

15 expected to be interchangable, it was decided to test the activity of the 5. aureus beta 

with E. coli Pol HI core. Lanes 3 and 4 shows that the S. aureus beta also stimulates 
E. coli core about 5-fold. This result can be explained by an interaction between the 
clamp and the polymerase that has been conserved during the evolutionary divergence 
of gram positive and gram negative cells. A chemical inhibitor that would disrupt this 

20 interaction would be predicted to have a broad spectrum of antibiotic activity, shutting 

down replication in Gram negative and Gram positive cells alike. This assay, and 
others based on this interaction, can be devised to screen chemicals for such 
inhibition. Further, since all the proteins in this assay are highly overexpressed 
through recombinant techniques, sufficient quantities of the protein reagents can be 

25 obtained for screening hundreds of thousands of compounds. 

In summary, the results show that S. aureus beta, produced in E. coli, is 
indeed an active protein (i.e., it stimulates polymerase activity). Furthermore, the 
results shows that Pol IH-L functions with a second protein (i.e., S. aureus beta). 
Before this experiment, there was no assurance that Pol DI-L, which is significantly 

30 different in structure from E. coli alpha, would function with another protein. For 

example, unlike E. coli alpha, which copurifies with several accessory proteins, Pol 
HI-L purified from B. subtilis as a single protein with no other subunits attached 
(Barnes et al., "Purification of DNA Polymerase ID of Gram-positive Bacteria," 
Methods in Enzy., 262:35-42 (1995), which is hereby incorporated by reference). 
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Finally, if one were to assume that £ aureus beta would function with a polymerase, 
the logical candidate would have been the product of the dnaE gene (alpha-small) 
instead of polC (Pol HI-L) since the dnaE product is more homologous to E. coli alpha 
subunit than Pol EH-L. 

5 

Example 10 - The S. aureus Beta Subunit Behaves as a Circular Sliding Clamp 

The ability of 5. aureus beta to stimulate Pol UI-L could be explained 
by formation of a 2-protein complex between Pol UI-L and beta to form a processive 

10 replicase similar to the Type II class (e.g., T7 type). Alternatively, the S. aureus 

replicase is organized as the Type HI replicase which operates with a circular sliding 
clamp and a clamp loader. In this case, the S. aureus beta would be a circular protein 
and would require a clamp loading apparatus to load it onto DNA. The ability of the 
beta subunit to stimulate Pol UI-L in Figure 5 A could be explained by the fact that the 

15 polydA-oligodT template is a linear DNA and a circular protein could thread itself 

onto the DNA over an end. Such "end threading" has been observed with PCNA and 
explains its ability to stimulate DNA polymerase delta in the absence of the RFC 
clamp loader (Burgers et al., "ATP-Independent Loading of the Proliferating Cell 
Nuclear Antigen Requires DNA Ends," J. Biol. Chem. , 268:19923-19926 (1993), 

20 which is hereby incorporated by reference). 

To distinguish between these possibilities, S. aureus beta was 
examined for ability to stimulate Pol IQ-L on a circular primed template. In 
Figure 5B, assays were performed using circular M13mpl8 ssDNA coated with 
E. coli SSB and primed with a single oligonucleotide to test the activity of beta on 

25 circular DNA. Lane 1 shows the extent of DNA synthesis using Pol UI-L alone. In 

lane 2, Pol III-L was supplemented with S. aureus beta. The S. aureus beta did not 
stimulate the activity of Pol m-L on this circular DNA (nor in the absence of SSB). 
Inability of S. aureus beta to stimulate Pol UI-L is supported by the results of Figure 6, 
lane 1 that analyzes the product of Pol IQ-L action on the circular DNA in an agarose 

30 gel in the presence of S. aureus beta. In summary, these results show that S. aureus 

beta only stimulates Pol UI-L on linear DNA, not circular DNA. Hence, the S. aureus 
beta subunit behaves as a circular protein. 
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Lane 3 shows the result of adding both S. aureus beta and E. coli 
gamma complex to Pol III-L. Again, no stimulation was observed (compare with lane 
1). This result indicates that the functional contacts between the clamp and clamp 
loader were not conserved during evolution of Gram positive and Gram negative cells. 
5 Controls for these reactions on circular DNA are shown for the E. coli 

system in Lanes 4-6. Addition of only beta to E. coli Pol m core did not result in 
stimulating the polymerase (compare lanes 4 and 5). However, when clamp loader 
complex was included with beta and core, a large stimulation of synthesis was 
observed (lane 6). In summary, stimulation of synthesis is only observed when both 
10 beta and clamp loader complex were present, consistent with inability of the circular 

beta ring to assemble onto circular DNA by itself. 

Example 11 - Pol III-L Functions as a Pol Ill-Type Replicase with Beta and a 
Clamp Loader Complex to Become Processive 

15 

Next, it was determined whether S. aureus Pol III-L requires two 
components (a beta clamp and a clamp loader) to extend a primer full length around a 
circular primed template. In Figure 6, a template circular M13mpl 8 ssDNA primed 
with a single DNA oligonucleotide was used. DNA products were analyzed in a 

20 neutral agarose gel which resolves starting materials (labeled ssDNA in Figure 6) 

from completed duplex circles (labelled RFII for replicative form II). The first two 
lanes show, as demonstrated in other examples, that Pol IH-L is incapable of 
extending the primer around the circular DNA in the presence of only S. aureus beta. 
In lane 4 of Figure 6, E. coli clamp loader complex (also known as gamma complex) 

25 and beta subunit were mixed with S. aureus Pol HI-L in the assay containing singly 

primed M13mpl8 ssDNA coated with SSB. If the beta clamp, assembled on DNA by 
clamp loader complex, provides processivity to S. aureus Pol III-L, the ssDNA circle 
should be converted into a fully duplex circle (RFII) which would be visible in an 
agarose gel analysis. The results of the experiment showed that the E. coli beta and 

30 clamp loader complex did indeed provide Pol IH-L with ability to fully extend the 

primer around the circular DNA to form the RFII (lane 4). The negative control using 
only E. coli clamp loader complex and beta is shown in lane 3. For comparison, lane 
6 shows the result of mixing the three components of the E. coli system (Pol HI core, 
beta, and clamp loader complex). This reaction gives almost exclusively full length 
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RPII product. The qualitatively different product profile that Pol m-L gives in the 
agarose gel analysis compared to E. coli Pol m core with beta and clamp loader 
complex shows that the products observed using Pol m-L is not due to a contaminant 
of E. coli Pol III core in the S. aureus Pol m-L preparation (compare lanes 4 and 6). 
5 It is generally thought that the polymerase of one system is specific for 

its SSB. However, these reactions are performed on ssDNA coated with the E. coli 
SSB protein. Hence, the S. aureus Pol m-L appears capable of utilizing E. coli SSB 
and the E. coli beta. It would appear that the only component that is not 
interchangeable between the Gram positive and Gram negative systems is the clamp 
10 loader complex. 

Thus, the 5. aureus Pol m-L functions as a Pol HI type replicase with 
the E. coli beta clamp assembled onto DNA by a clamp loader complex. 

Example 12 - Purification of Two DNA Polymerase Ill-Type Enzymes From 
15 S. aureus Cells 

The MonoQ resin by Pharmacia has very high resolution which would 
resolve the three DNA polymerases of S. aureus. Hence, S. aureus cells were lysed, 
DNA was removed from the lysate, and the clarified lysate was applied onto a MonoQ 

20 column. The details of this procedure are: 300 L of S. aureus (strain 4220, a gift of 

Dr. Pat Schlievert, University of Minnisota) was grown in 2X LB media at 37°C to an 
O.D. of approximately 1.5 and then were collected by centrifugation. Approximately 
2 kg of wet cell paste was obtained and stored at -70°C. 122 g of cell paste was 
thawed and resuspended in 192 ml of cell lysis buffer followed by passage through a 

25 French Press cell (Aminco) at 40,000 psi. The resultant lysate was clarified by high 

speed centrifugation (1 .3 g protein in 1 20 ml). A 20 ml aliquot of the supernatant was 
dialyzed 2 h against 2 L of buffer A containing 50 mM NaCl. The dialyzed material 
(148 mg, conductivity = 101 mM NaCl) was diluted 2-fold with Buffer A containing 
50 mM NaCl and then loaded onto an 8 ml MonoQ column equilibrated in Buffer A 

30 containing 50 mM NaCl. The column was washed with Buffer A containing 50 mM 

NaCl, and then eluted with a 160 ml linear gradient of 0.05 M NaCl to 0.5 M NaCl in 
Buffer A. Fractions of 2.5 ml (64 total) were collected, followed by analysis in an 
SDS polyacrylamide gel for their replication activity in assays using calf thymus 
DNA. 
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Three peaks of DNA polymerase activity were identified (Figure 7). 
Previous studies of cell extracts prepared from the Gram positive organism Bacillus 
subtilis identified only two peaks of activity off a DEAE column (similar charged 
resin to MonoQ)- The first peak was Pol n, and the second peak was a combination of 
5 DNA polymerases I and m. The DNA polymerases I and III were then separated on a 

subsequent phosphocellulose column. The middle peak in Figure 7 is much larger 
than the other two peaks and, thus, it was decided to chromatograph this peak on a 
phosphocellulose column. The second peak of DNA synthetic activity was pooled 
(fractions 37-43; 28 mg in 14 ml) and dialyzed against 1.5 L P-cell buffer for 2.5 h. 

10 Then, the sample (ionic strength equal to 99 mM NaCl) was applied to a 5 ml 

phosphocellulose column equilibrated in P-cell buffer. After washing the column in 
10 ml P-cell buffer, the column was eluted with a 60 ml gradient of 0 - 0.5 M NaCl in 
P-cell buffer. Seventy fractions were collected and then analyzed for DNA synthesis 
using calf thymus DNA as template. This column resolved the polymerase activity 

15 into two distinct peaks (Figure 7B). 

Hence, there appear to be four DNA polymerases in Staphylococcus 
aureus. They were designated here as peak 1 (first peak off MonoQ), peak 2 (first 
peak off phosphocellulose), peak 3 (second peak of phosphocellulose), and peak 4 
(last peak off Mono Q) (see Figure 7). Peak 4 was presumably Pol ni-L, as it elutes 

20 from MonoQ in a similar position as the Pol III-L expressed in E. coli (compare 

Figure 7A with Figure 2). 

Example 13 - Demonstration That Peak 1 (Pol III-2) Functions as a Pol Ill-Type 
RepHcase With E. coli Beta Assembled on DNA by E. coli Clamp 
25 Loader Complex. 

To test which peak contained a Pol Ill-type of polymerase, an assay 
was used in which the E. coli clamp loader complex and beta support formation of full 
length RFII product starting from E. coli SSB coated circular M13mpl8 ssDNA 
30 primed with a single oligonucleotide. In Figure 8, both Peaks 1 and 2 are stimulated 

by the E. coli clamp loader complex and beta subunit and, in fact, Peaks 2 and 3 are 
inhibited by these proteins (the quantitation is shown below the gel in the figure). 
Further, the product analysis in the agarose gel shows full length RFH duplex DNA 
circles only for peaks 1 and 4. These results, combined with the NEM, pCMB, and 
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KC1 characteristics in Tables 2 and 3 below, suggest that there are two Pol Hi-type 
DNA polymerases in S. aureus and that these are partially purified in peaks 1 and 4. 

Next, it was determined which of these peaks of DNA polymerase 
activity correspond to DNA polymerases I, n, and m, and which peak is the 
5 unidentified DNA polymerase. In the Gram postive bacterium B. subtilis, Pol IE is 

inhibited by pCMB, NEM, and 0. 1 5 M NaCl, Pol II is inhibited by KC1, but not NEM 
or 0.15 M KCL, and Pol I is not inhibited by any of these treatments (Gass et al., 
"Further Genetic and Enzymological Characterization of the Three Bacillus subtilis 
Deoxyribonucleic Acid Polymerases," J. Biol. Chem., 248:7688-7700 (1973), which 

10 is hereby incorporated by reference). Hence, assays were performed in the presence or 

absence of pCMB, NEM, and 0.15 M KC1 (see Tables 2 and 3 below). Peak 3 clearly 
corresponded to Pol I, because it was not inhibited by NEM, pCMB, or 0.15 M NaCL 
Peak 2 correspond to Pol n, because it was not inhibited by NEM, but was inhibited 
by pCMB and 0.15 M NaCl. Peaks 1 and 4 both had characteristics that mimic Pol 

15 HI; however, peak 4 elutes on MonoQ at a similar position as Pol HI-L expressed in E. 

coli (see Figure 2B). Hence, peak 4 is likely Pol m-L, and peak 1 is likely the 
unknown polymerase. 



Table 2: Expected Characteristics of Polymerases 



Polymerase 


pCMB 


NEM 


0.15M KC1 


Poll 


not inhibited* 


not inhibited 


not inhibited 


PolD 


inhibited** 


not inhibited 


not inhibited 


Pol m-L 


inhibited 


inhibited 


not inhibited 


* Not inhibited is defined as greater than 75% remaining activity 
** Inhibited is defined as less than 40% remaining activity 




Table 3: Observed Characteristics 






Peak 


pCMB 


NEM 


0.1 5M KCL assignment 


Peakl 


inhibited 


inhibited 


new polymerase 


Peak2 


inhibited 


not inhibited 


Poin 


Peak3 


not inhibited 


not inhibited 


Poll 


Peak4 


inhibited 


inhibited 


Poi m-L 



20 
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Example 14 - Identification and Cloning of S. aureus dnaE 

This invention describes the finding of two DNA polymerases that 
function with a sliding clamp assembled onto DNA by a clamp loader. One of these 
5 DNA polymerases is likely Pol HI-L, but the other has not been identified previously. 

Presumably, the chromatographic resins used in earlier studies did not have the 
resolving power to separate the enzyme from other polymerases. This would be 
compounded by the low activity of Pol m-2. To identify a gene encoding the second 
Pol HI, the amino acid sequences of the Pol DI alpha subunit of Escherichia coli, 
10 Salmonella typhimurium, Vibrio cholerae, Haemophilis influenzae, and Helicobacter 

pylori were aligned using Clustal W (1.5). Two regions about 400 residues apart were 
conserved and primers were designed for the following amino acid sequences: 

Upstream, corresponding in E. coli to residues 385-399 (SEQ. ID. No. 43) 

15 Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 

15 10 ~ 15 

Downstream, corresponding in E. coli to residues 750-764 (SEQ. ID. No. 44) 

Lys Phe Ala Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr 
20 1 5 10 15 

The following primers were designed to these two peptide regions using codon 
preferences for S. aureus: 

25 Upstream (SEQ. ID. No. 45) 

cttctttttg aaagatttct aaataaagaa cgttattcaa tgcc 44 

Downstream (SEQ. ID. No. 46) 

ataagctgca gcatgacttt tattaaaacc ataacctgca aattt 45 

30 

Amplification was performed using 2.5 units of Tag DNA Polymerase (Gibco, BRL), 
100 ng S. aureus genomic DNA, 1 mM of each of the four dNTPs, 1 of each 
primer, and 3 mM MgCb in 100 ^1 of Taq buffer. Thirty-five cycles of the following 
scheme were repeated: 94°C, 1 min; 55°C, 1 min; 72°C, 90 sec. The PCR product 
35 (approximately 1 . 1 kb) was electrophoresed in a 0.8 % agarose gel and purified using 
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a Geneclean IH kit (Bio 101). The product was then divided equally into ten separate 
aliquots and used as a template for PCR reactions, according to the above protocol, to 
reamplify the fragment for sequencing. The final PCR product was purified using a 
Quiagen Quiaquick PCR Purification kit, quantitated via optical density at 260 nM, 
5 and sequenced by the Protein/DNA Technology Center at Rockefeller University. The 

same primers used for PCR were used to prime the sequencing reactions. 

Next, the following additional PCR primers were designed to obtain 
more sequence information 3' to the first amplified section. 

10 Upstream (SEQ. ID. No. 47) 

agttaaaaat gccatatttt gacgtgtttt agttctaat 3 9 

Downstream (SEQ. ID. No. 48) 

cttgcaaaag cggttgctaa agatgttgga cgaattatgg gg 42 

15 

These primers were used in a PCR reaction using 2.5 units of Taq DNA Polymerase 
(Gibco, BRL) with 100 ng S. aureus genomic DNA as a template, ImM dNTP's, 
1 |j,M of each primer, and 3 mM MgCl 2 in 100 1 of Taq buffer. Thirty-five cycles of 
the following scheme were repeated: 94°C, 1 min; 55°C, 1 min; 72°C, 2 min 30 

20 seconds. The 1 .6 Kb product was then divided into 5 aliquots, and used as a template 

in a set of 5 PCR reactions, as described above, to amplify the product for sequencing. 
The products of these reactions were purified using a Qiagen Qiaquick PCR 
Purification kit, quantitated via optical density at 260 nm, and sequenced by the 
Protein/DNA Technology Center at Rockefeller University. The sequence of this 

25 product yielded about 740 bp of new sequence 3' of the first sequence. 

As this gene shows better homology to the Gram negative Pol m a 
subunit compared to Gram positive Pol m-L, it will be designated the dnaE gene. 



30 



Example 15 - Identification and Cloning of S. aureus dnaX 

The fact that the S. aureus beta stimulates Pol m-L and has a ring 
shape suggests that the Gram postive replication machinery is of the three component 
type. This implies the presence of a clamp loader complex. This is not a simple 
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determination to make as the B. subtilis genome shows homologs to only two of the 
five subunits of the E. coli clamp loader (dnaX encoding gamma, and holB encoding 
delta prime). On the basis of the experiments in this application, which suggests that 
there is a clamp loader, it was believed that these two subunit homologues are part of 
5 the clamp loader for the S. aureus beta. 

As a start in obtaining the clamp loading apparatus, a strategy was 
devised to obtain the gene encoding the tau subunit of S. aureus. In E. coli, the tau 
and gamma subunits are derived from the same gene. Tau is the full length product, 
and gamma is about 2/3 the length of tau. Gamma is derived from the dnaX gene by 

10 what was originally believed to be an efficient translational frameshift mechanism 

that, after it occurs, incorporates only one unique C-terminal residue before 
encountering a stop codon. To identify the dnaX gene of S. aureus by PCR analysis, 
the dnaX genes of B. subtilis, E. coli, and H. influenzae were aligned. Upon 
comparison of the amino acid sequence encoded by these dnaX genes, two areas of 

15 high homology were used to predict the amino acid sequence of the S. aureus dnaX 

gene product. PCR primers were designed to these sequences, and a PCR product of 
the expected size was indeed produced. DNA primers were designed to two regions 
of high similarity for use in PCR that were about 1 00 residues apart. The amino acid 
sequences of these regions were: 

20 

Upstream, corresponding to residues 39-48 of E. coli (SEQ. ID. No. 49) 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
1 5 10 

25 Downstream, corresponding to residues 138-148 of E. coli (SEQ. ID. No. 50) 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
1 5 ^ 10 

The DNA sequence of the PCR primers was based upon the codon usage of S. aureus. 
30 The primers are as follows: 



Upstream (SEQ. ID. No. 51) 

cgc ggatcc c atgcatattt attttcaggt ccaagagg 



38 
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Downstream (SEQ. ID. No. 52) 

ccg gaattc t ggtggttctt ctaatgtttt taataatgc 3 9 

The first 9 nucleotides of the upstream primer (SEQ. ED. No. 51) contain a BamHI 
site, which is underlined, and do not correspond to amino acid codons; the 3' 29 
nucleotides correspond to the amino acid sequence of SEQ. ID. No. 49. The EcoRI 
site of the downstream primer (SEQ. ID. No. 52) is underlined and the 3' 33 
nucleotides correspond to the amino acid sequence of SEQ. ID. No. 50. 

The expected PCR product, based on the alignment, is approximately 
268 bp between the primer sequences. Amplification was performed using 500 ng 
genomic DNA, 0.5 mM dNTPs, 1 of each primer, 1 mM MgSC>4, 2 units vent 
DNA polymerase in 100 jal of vent buffer. Forty cycles were performed using the 
following cycling scheme: 94°C, 1 min; 60°C, 1 min.; 72°C, 30s. The approximately 
300 bp product was digested with EcoRI and BamHI and purified in a 0.7 % agarose 
gel. The pure digested fragment was ligated into pUC18 which had been digested 
with EcoRI and BamHI and gel purified in a 0.7 % agarose gel. Ligated products 
were transformed into E. coli competent DH5a cells (Stratagene), and colonies were 
screened for the correct chimera by examining minipreps for proper length and correct 
digestion products using EcoRI and BamHI. The sequence of the insert was 
determined and was found to have high homology to the dnaX genes of several 
bacteria. This sequence was used to design circular PCR primers. Two new primers 
were designed for circular PCR based on this sequence. 

A circular PCR product of approximately 1.6 kb was obtained from a 
HincII digest of chromosomal DNA that was recircularized with ligase. This first 
circular PCR yielded most of the remaining dnaX gene. The two primers were as 
follows: 

Rightward (SEP. ID. No. 53) 

tttgtaaagg cattacgcag gggactaatt cagatgtg 3 8 



Leftward (SEQ. ID. No. 54) 

tatgacattc attacaaggt tctccatcag tgc 
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Genomic DNA (3 ng) was digested with HincII, purified with phenol/chloroform 
extraction, ethanol precipitated and redissolved in 70 ^1 T.E. buffer. The genomic 
DNA was recircularized upon adding 4000 units T4 ligase (New England Biolabs) in 
a final volume of 100 ^1 T4 ligase buffer (New England Biolabs) at 16°C overnight. 
5 The PCR reaction consisted of 90 ng recircularized genomic DNA, 0.5 mM each 

dNTP, 100 pmol of each primer, 1.4 mM magnesium sulfate, and 1 unit of elongase 
(GIBCO) in a final volume of 100 ^1 elongase buffer (GIBCO). 40 cycles were 
performed using the following scheme: 94°C, 1 min.; 55°C, 1 min.; and 68°C, 2 min. 
The resulting PCR product was approximately 1 .6 kb. The PCR product was purified 

10 from a 0.7 % agarose gel and sequenced directly. A stretch of approximately 750 

nucleotides was obtained using the rightward primer used in the circular PCR 
reaction. To obtain the rest of the sequence, other sequencing primers were designed 
in succession based on the information of each new sequencing run. 

This sequence, when spliced together with the previous 300 bp PCR 

15 sequence, contained the complete N-terminus of the gene product (stop codons are 

present upstream) and possibly lacked only about 50 residues of the C-terminus. The 
amino terminal region of E. coli tau shares what appears to be the most conserved 
region of the gene as this area shares homology with RFC subunit of the human clamp 
loader and with the gene 44 protein of the phage T4 clamp loader. An alignment of 

20 the N-terminal region of the S. aureus tau protein with that of B. subtilis and E. coli is 

shown in Figure 10. Among the highly conserved residues are the ATP binding site 
consensus sequence and the four cystine residues that form a Zn 2+ finger. 

After obtaining 1 kb of sequence in the 5' region of dnaX, it was 
sought to determine the remaining 3' end of the gene. Circular PCR products of 

25 approximately 800bps, 600bps, and 1600bps were obtained from Apo I, or Nsi I or 

Ssp I digest of chromosomal DNA that were recircularized with ligase. 



30 



Rightward (SEQ. ID. No. 55) 

gagcactgat gaacttagaa ttagatatg 2 9 



Leftward (SEQ. ID. No. 56) 

gatactcagt atctttctca gatgttttat tc 
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Genomic DNA (3 g) was digested with, Apo I, or Nsi I or Ssp I, purified with 
phenol/chloroform extraction, ethanol precipitated, and redissolved in 70 1 T.E. buffer. 
The genomic DNA was recircularized upon adding 4000 units of T4 ligase (New 
England Biolabs) in a final volume of 100 1 T4 ligase buffer (New England Biolabs) at 
5 16°C overnight. The PCR reaction consisted of 90 ng recircularized genomic DNA, 

0.5 mM each dNTP, 100 pmol of each primer, 1.4 mM magnesium sulfate, and 1 unit 
of elongase (GBBCO) in a final volume of 100 1 elongase buffer (GIBCO). 40 cycles 
were performed using the following scheme: 94°C, 1 min.; 55°C, 1 min.; 68°C, 2 min. 
The PCR products were directly cloned into pCR II TOPO vector using the TOPO TA 
10 cloning kit (Invitrogen Corporation) for obtaining the rest of the C terminal sequence 

of S. aureus dnaX. DNA sequencing was performed by the Rockefeller University 
sequencing facility. 



15 



25 



Example 16 - Identification and Cloning of S- aureus dnaB 



In E. coli, the DnaB helicase assembles with the DNA polymerase in 
holoenzyme to form a replisome assembly. The DnaB helicase also interacts directly 
with the primase to complete the machinery needed to duplicate a double helix. As a 
first step in studying how the S. aureus helicase acts with the replicase and primase, S. 
20 aureus was examined for presence of a dnaB gene. 

The amino acid sequences of the DnaB helicase of Escherichia coli, 
Salmonella typhimurium, Haemophilis influenzae, and Helicobacter pylori were 
aligned using Clustal W (1.5). Two regions about 200 residues apart showed good 
homology. These peptide sequences were: 



Upstream, corresponding to residues 225-238 of E. coli DnaB (SEQ. ID. No. 57) 

Asp Leu lie lie Val Ala Ala Arg Pro Ser Met Gly Lys Thr 
15 10 



30 Downstream, corresponding to residues 435-449 of E. coli DnaB (SEQ. ID. No. 58) 

Glu lie lie lie Gly Lys Gin Arg Asn Gly Pro lie Gly Thr Val 
15 10 15 



35 



The following primers were designed from regions which contained conserved 
sequences using codon preferences for S. aureus: 
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Upstream (SEQ. ID. No. 59) 

gaccttataa ttgtagctgc acgtccttct atgggaaaaa c 41 

5 Downstream (SEQ. ID. No. 60) 

aacattatta agtcagcatc ttgttctatt gatccagatt caacgaag 48 

A PCR reaction was carried out using 2.5 units of Tag DNA Polymerase (Gibco, 
BRL) with 100 ng. S. aureus genomic DNA as template, 1 mM dNTP's, ljxM of each 

10 primer, 3 mM MgCb in 100 jil of Tag buffer. Thirty-five cycles of the following 

scheme were repeated: 94°C, 1 min.; 55°C, 1 min.; and 72°C, 1 min. Two PCR 
products were produced, one was about 1.1 kb, and another was 0.6 kb. The smaller 
one was the size expected. The 0.6 kb product was gel purified and used as a template 
for a second round of PCR as follows. The 0.6 kb PCR product was purified from a 

15 0.8% agarose gel using a Geneclean HI kit (Bio 101) and then divided equally into 

five separate aliquots, as a template for PCR reactions. The final PCR product was 
purified using a Quiagen Quiaquick PCR Purification kit, quantitated via optical 
density at 260 nM, and sequenced by the Protein/DNA Technology Center at 
Rockefeller University. The same primers used for PCR were used to prime the 

20 sequencing reaction. The amino acid sequence was determined by translation of the 

DNA sequence in all three reading frames, and selecting the longest open reading 
frame. The PCR product contained an open reading frame over its entire length. The 
predicted amino acid sequence shares, homology to the amino acid sequences encoded 
by dnaB gene of other organisms. 

25 Additional sequence information was determined using the circular 

PCR technique. Briefly, S. aureus genomic DNA was digested with various 
endonucleases, then religated with T4 DNA ligase to form circular templates. To 
perform PCR, two primers were designed from the initial sequence. 

30 First primer (SEQ. ID. No. 61) 

gatttgtagt tctggtaatg ttgactcaaa ccgcttaaga accgg 45 



Second primer (SEQ. ID. No. 62) 

atacgtgtgg ttaactgatc agcaacccat ctctagtgag aaaatacc 



48 
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The first primer matches the sequence of the coding strand and the second primer 
matches the sequence of the complementary strand. These two primers are directed 
outwards from a central point, and allow determination of new sequence information 
5 up to the ligated endonuclease site. A PCR product of approximately 900 bases in 

length was produced using the above primers and template derived from the ligation 
of S. aureus genomic DNA which had been cut with the restriction endonuclease Apo 
I. This PCR product was electrophoresed in a 0.8% agarose gel, eluted with a Qiagen 
gel elution kit, divided into five separate aliquots, and used as a template for 

10 reamplification by PCR using the same primers as described above. The final product 

was electrophoresed in an 0.8% agarose gel, visualized via staining with ethidium 
bromide under ultraviolet light, and excised from the gel. The excised gel slice was 
frozen, and centrifuged at 12,000 rpm for 15 minutes. The supernatant was extracted 
with phenol/chloroform to remove ethidium bromide, and was then cleaned using a 

15 Qiagen PCR purification kit. The material was then quantitated from its optical 

density at 260 nm and sequenced by the Protein/DNA Technology Center at the 
Rockefeller University. 

The nucleotide sequence contained an open reading frame over its 
length, up to a sequence which corresponded to the consensus sequence of a cleavage 

20 site of the enzyme Apo I. Following this point, a second open reading frame encoded 

a different reading frame up to the end of the product. The inital sequence 
information was found to match the inital sequence and to extend it yet further 
towards the C-terminus of the protein. The second reading frame was found to end in 
a sequence which matched the S'-terminus of the previously determined sequence and, 

25 thus, represents an extension of the sequence towards the N-terminus of the protein. 

Additional sequence information was obtained using the above primers 
and a template generated using S. aureus genomic DNA circularized via ligation with 
T4 ligase following digestion with Cla I. The PCR product was generated using 35 
cycles of the following program: denaturation at 94°C for 1 min.; annealing at 55°C 

30 for 1 min.; and extension at 68°C for 3 minutes and 30 s. The PCR products were 

electrophoresed in a 0.8% agarose gel, eluted with a Qiagen gel elution kit, divided 
into five separate aliquots, and used as a template reamplification via PCR with the 
same primers described above. The final product was electrophoresed in an 0.8% 
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agarose gel, visualized via staining with ethidium bromide under ultraviolet light, and 
excised from the gel. The excised gel slice was frozen, and centrifuged at 12,000 rpm 
for 15 min. The supernatant was cleaned using a Qiagen PCR purification kit. The 
material was then quantitated via optical density at 260 nm and sequenced by the 
5 Protein/DNA Technology Center at Rockefeller University. The open reading frames 

continued past 500 bases. Therefore, the following additional sequencing primers 
were designed from the sequence to obtain further information: 

First primer (SEP. ID. No. 63) 
10 cgttttaatg catgcttaga aacgatatca g 31 

Second primer (SEP. ID. No. 64) 

cattgctaag caacgttacg gtccaacagg c 31 

15 The N-terminal and C-terminal nucleotide sequence extensions 

generated using this circular PCR product completed the 5' region of the gene 
(encoding the N-terminus of DnaB); however, a stop codon was not reached in the 3* 
region and, thus, a small amount of sequence is still needed to complete this gene. 

The alignment of the S. aureus dnaB with E. coli dnaB and the dnaB 

20 genes of B. subtilis and S. typhimurium is shown in Figure 1 1 . 

Example 17 - Identification and Cloning of S. aureus holB 

The S. aureus holB was identified by searching the S. aureus database 
25 with the sequences of S. pyogenes 5* subunit. The S. aureus holB encodes a 253 

residue protein of about 28 kDa. The holB gene was amplified by PCR using an 
upstream 69-mer primer as follows: 



30 



Upstream Primer (SEQ. ID. No. 65): 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatac ccatg 60 
gatg aacag g 9 



which contains an Ncol site (underlined), and a downstream 39-mer primer as 
follows: 
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Downstream Primer (SEQ. ID. No. 66): 

aattttaaag gatcc gtgta taatattcta attttcccg 39 

5 which contains a BamHI site (underlined). The PCR product was digested with Ncol 

and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl la to produce 
plasmid pETSaholB. 

Example 18 - Purification of S. aureus 5 1 

10 

The pETSaholB plasmid of Example 17 was transformed into E. 
coli &L2l(DE3)recA. A single colony was used to innoculate 2L of LB media 
supplemented with 200 |xg/ml ampicillin. Cells (2L) were grown at 37°C to 
OD 60 o=0.5 at which point the temperature was lowered to 15°C and 0.5 mM IPTG 

15 was added. After 16 hr of induction, cells were collected by centrifugation, 

resuspended in 50 mM Tris-HCl (pH 7.5), 10% sucrose, 1 M NaCl, 30 mM 
spermidine, 5 mM DTT, and 2 mM EDTA. Cells were lysed by two passages through 
a French press (15,000 psi), followed by centrifugation at 13,000 rpm for 30 min at 
4°C. Ammonium sulfate (0.3 g/ml) was added to the clarified lysate. The pellet was 

20 backwashed in 30 ml buffer A containing 0. 1 M NaCl and 0.24 g/ml ammonium 

sulfate using a Dounce homogenizer, then the pellet was recovered by centrifugation. 
The resulting pellet was resuspended in 20 ml of buffer A and dialyzed against buffer 
A. The dialyzed protein was applied to a 20 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 200 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

25 80 fractions were collected. Peak fractions (54 - 75) were combined (72 mg) and 

dialyzed against buffer A. The 5 f preparation was aliquoted and stored frozen at - 
80°C. 

Example 19 - Identification and Cloning of S. aureus holA 

30 

The S. aureus holA gene was identified by searching the S. aureus 
database with the sequences of E. coli and 5. pyogenes 5 subunits. The S. aureus holA 
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gene encodes a 288 residue protein of about 32 kDa. The holA gene was amplified by 
PCR using an upstream 28-mer primer as follows: 

Upstream Primer (SEQ. ID. No. 67): 
5 gggagtttgt aat ccatgg a tgaacagc 2 8 

which contains a Ncol site (underlined), and a downstream 37-mer primer as follows: 



Downstream Primer (SEQ. ID. No. 68): 
10 ctgaacacct attac cctag gcatctaact cacaccc 3 7 

which contains a BamHI site (underlined). The PCR product was digested with Ncol 
and BamHI, purified, and ligated into the Ncol and BamHI sites of pETl 1 a to produce 
plasmid pETSaholA. 

15 

Example 20 - Purification of S. aureus 8 



The pETSaholA plasmid of Example 19 was transformed into E. coli 
NovaBlue {recAl lac[F'proA + B + lac q ZAM15::Tn!0(Tc R )) (Novagen). A single 

20 colony was used to innoculate 12L of LB media supplemented with 200 |J.g/ml 

ampicillin. Cells (12L) were grown at 37°C to 00600=0.5 at which point the 
temperature was lowered to 15°C and 0.5 mM IPTG was added. After 16 hr of 
induction, cells were collected by centrifugation, resuspended in 50 mM Tris-HCl (pH 
7.5), 10% sucrose, 1M NaCl, 30 mM spermidine, 5 mM DTT, and 2 mM EDTA. 

25 Cells were lysed by two passages through a French press (1 5,000 psi), followed by 

centrifugation at 13,000 rpm for 30 min at 4°C. Ammonium sulfate (0.3 g/ml) was 
added to the clarified lysate. The resulting pellet was resuspended in 250 ml of buffer 
A. The dialyzed protein was applied to a 100 ml FFQ Sepharose column equilibrated 
in buffer A and eluted with a 1000 ml linear gradient of 0 - 500 mM NaCl in buffer A; 

30 80 fractions were collected. Peak fractions (40-49) were combined (65 mg) and 

dialyzed against buffer A. The dialyzed protein was applied to a 8 ml MonoQ 
Sepharose column equilibrated in buffer A and eluted with a 80 ml linear gradient of 0 
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- 500 mM NaCl in buffer A; 80 fractions were collected. Peak fractions of the 5 
preparation were stored frozen at -80°C. 

Example 21 - Consitution of a Processive S. aureus DNA Polymerase III Enzyme 
5 from Three Components 

The PolC (alpha-large) requires the p clamp for processivity, which in 
turn requires the clamp loader (tSS 1 ) for assembly onto DNA. The S. aureus clamp 
loader, t8S' complex, was assembled by mixing the three proteins as follows: 400 |ig 

10 of x and 80 jag each of 5 and 5* were mixed in buffer A containing no NaCl and 

preincubated at 15°C for 10 min. The mixture was injected onto a 1 ml MonoQ 
column equilibrated in buffer A, and then eluted with a 30 ml linear gradient of 0-500 
mM NaCl in buffer A; 60 fractions were collected. Fractions were analyzed in a 10% 
SDS-polyacrylamide gel stained with Coomassie Blue. Peak fractions (40-50) were 

15 combined and concentrated using a Centricon 30 concentrator. 

The ability of the three components to work together to form the 
processive Pol HI was tested by determining whether x55' and p clamp could confer 
the ability of PolC to completely extend a single primer full circle around a large 7.2 
kb circular M13mpl8 ssDNA genome. Replication reaction contained 70 ng (25 

20 fmol) on singly primed Ml 3mpl 8 ssDNA, 20 ng S. aureus p, 50 ng S. aureus PolC, 

either 30 ng or 90 ng of S. aureus x88' (when indicated), and 0.82 ng of S. pyogenes 
SSB in 24 Kil of 20 mM Tris-HCl (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 mM DTT, 
2 mM ATP, 8 mM MgCl 2 , 40 |ag/ml BSA, and 60 mM each of dGTP and dCTP. 
Reactions were pre-incubated for 2 min at 37°C to assemble protein complexes on the 

25 primer terminus. DNA synthesis was initiated upon addition of 1 .5 jal dATP and 32 P- 

TTP (specific activity 2,000-4,000 cpm/pmol) and synthesis was allowed to proceed 
for 1 min before being quenched with an equal volume (25 |il) of a solution of 1% 
SDS and 40 mM EDTA. One-half of the quenched reaction was analyzed for total 
DNA synthesis using DE81 paper as described, and the other half was analyzed by 

30 agarose gel phoresis. An autoradiogram of the agarose gel analysis of the replication 

products is depicted in Figure 13, which shows that the presence of PolC and p, but 
absence of t55* (lane 1) gives no full length circular duplex (RFII). However, in the 
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presence of t58' (lanes 2 and 3), full length circular duplex DNA (RFII) is produced, 
as expected for the action of a processive Pol HI holozyme. 

Example 22 - General Induction/Purification Conditions for S. pyogenes 

The purification protocols for S. pyogenes proteins were performed 
using following standardized conditions. Cells were grown from a single colony, 
freshly transformed overnight. Cells were grown in 200 jig/ml Ampicillin to 
OD600=0.3-0.4, at which point cultures were chilled prior to addition of BPTG (to a 
final concentration of 0.5 mM) and were allowed to incubate for 16 hrs at 15°C. 
Following this, all procedures were performed at 4°C. Cell paste (1-2 g/liter of 
culture) was resuspended (10 ml/g cell paste) in 50 mM Tris-HCl (pH 7.5)/10% 
Sucrose/1 M NaCl/5 mM DTT/ 30 mM Spermidine/1 X Heat lysis buffer (50 mM 
Tris-HCl (pH 7.5), 1% Sucrose, 100 mM NaCl, 2 mM EDTA). Cells were lysed by 
two passages through the French Press (15,000 psi) followed by centrifiigation at 
14,000 rpm at 4 °C. Ammonium sulfate, when added to the cleared lysate, was added 
gradually. Precipitate was allowed to settle on ice for a minimum of 30 min prior to 
collection by centrifiigation. Protein pellets were resuspended in buffer A (50 mM 
Tris-HCl pH 7.5, 1 mM EDTA, 5 mM DTT, 10% glycerol) and dialyzed for over 3 
hours in the same buffer. Column design is based on the manufacturer's suggested 
capacities: Fast Flow Q (FFQ) and MonoQ are 20 mg protein /ml resin, Heparin- 
Affigel agarose is 1.2 mg protein/ml resin. Elution was performed using 10 column 
volume (c.v.) gradients, and the entire gradient elution profile was collected in 80 
fractions. Unless mentioned otherwise all columns were equilibrated and eluted with 
buffer A. 

Example 23 - Identification of a S. pyogenes holA gene Encoding a Functional 
Delta Subunit and Purification of the Delta Subunit 

Alignment of E. coli delta subunit with 1 0 other putative holA products 
from unfinished genome databases of Gram negative bacteria indicates a region of 
conserved amino acid sequence. Amino acids Q 140 to L230 of E. coli delta were 
used to search the B. subtilis genome database for a Gram positive delta homolog. 
This search revealed yqeN, a potential reading frame of unknown function, as the 
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highest scoring sequence. Although the score was low, it was treated as a candidate 
for Gram positive delta. The alignment with E. coli delta is shown in Figure 12 A. A 
Streptococcus pyogenes genome database was searched with yqeN. Two contigs 
which represent N- (contig 206) and C- (contig 264) termini of S. pyogenes delta 
5 subunit were identified. The alignment of the putative S. pyogenes hoi A with B. 

subtilis yqeN is shown in Figure 12B. The following primers were used to obtain 
PCR products for delta subunit: 

holA Upstream (SEQ. ID No. 69) 
10 ggagcagatt gcttttgata catatgattg gcctattc 38 

holA Downstream (SEQ. ID No. 70) 

ttgtctccgc atcaaactgg gatccaagag catcatacgc gtatgg 4 6 

15 These primers were used to amplify the holA gene from S. pyogenes genomic DNA. 

The PCR product was digested with Ndel and BamHI, purified and ligated into the 
pETl la vector to produce pETl la.S.p. holA. 

The pETl la.S.p. hoi A plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 

20 was used to innoculate 12L LB broth supplemented with 200 jig/ml Ampicillin. Cells 

were grown at 37°C to OD600=0.5, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
cells were collected by centrifugation and resuspended in 50 mM Tris-HCl (pH 7.5)/ 
10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/30 mM Spermidine/5 mM DTT. Cells 

25 were lysed by two passages through the French press (1 5,000 psi), followed by 

centrifugation at 13,000 rpm for 30 min. The supernatant was decanted and 
ammonium sulfate was added to a final concentration of 0.226 g/ml. The resulting 
pellet was collected by centrifugation and resuspended in 20 ml of buffer A. The 
resuspended pellet was dialyzed against buffer A containing no salt. The dialyzed 

30 protein (500 mg) was loaded onto a FFQ- Sepharose (35 ml) column and eluted with a 

linear gradient from 0 - 500 mM NaCl ( 10 c.v.). The peak fractions (21-45) were 
combined and dialyzed against buffer A (0 NaCl) for 3 hrs, then diluted to a 
conductivity of 50 mM NaCl and loaded (160 mg) onto a 120 ml Heparin- Affigel 
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column. Protein was eluted with a linear gradient of 0-500 mM NaCl (10 c.v.). The 
fractions containing the least contaminants (39-51) were precipitated with ammonium 
sulfate (0.226 g), collected by centrifugation, resuspended 5 ml of buffer A, and 
dialyzed in buffer A containing 200 mM NaCl. The delta subunit was stored at - 
80°C. The final delta subunit preparation is shown in the lane marked 8 of the 
Coomassie Blue stained SDS-polyacrylamide gel of Figure 14. Yield = 65 mg. 

Example 24 - Identification of S. pyogenes holB Encoding Delta Prime and 
Purification of the Delta Prime Subunit 

A search of the S. pyogenes genome database with the predicted B. 
subtilis delta prime amino acid sequence revealed a DNA sequence in contig #209 
(previously known as contig #210) that predicted a high scoring match for a gene 
encoding a delta prime protein. The following primers were used to obtain PCR 
products for holB: 

holB Upstream (SEQ. ID. No. 71) 

gcctaggata agggagggta catatggatt tagcgc 36 
holB Downstream (SEP. ID. No. 72) 

cgggcaagtc ttttgacaag cttcggatcc ccataacgaa ttcc 44 

The PCR product obtained from these primers was digested with Ndel and BamHI, 
purified and ligated into the pETl la vector to produce pETl la.S.p. holB. 

The pETl la.S.p.holB plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ^ig/ml Ampicillin. Cells 
were grown at 37°C to O.D.600=0.4, at which point the temperature was lowered to 
1 5°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer. Lysis was initiated upon addition of 
0.4 mg/ml lysozyme followed by a 1 hr incubation on ice. Lysate was clarified by 
centrifugation at 13,000 rpm for 30 min. Ammonium sulfate was added to the 
supernatant to a final concentration of 0.3 g/ml. The protein pellet was resuspended in 
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buffer A(0.1 M NaCl) + 0.24 g/ml ammonium sulfate and clarified by centrifligation. 
The resulting protein pellet was resuspended in 20 ml of buffer A and dialyzed against 
buffer A. The dialyzed protein (450 mg) was loaded onto a 30 ml FFQ- Sepharose 
column and eluted with a linear gradient from 0 - 500 mM NaCl. The peak fractions 
were combined (fr# 20-30 containing 130 mg) and dialyzed against buffer A and 
loaded (70 mg) onto a 50 ml Heparin- Affigel column. Protein was eluted with a 
linear gradient of 0-500 mM NaCl. Delta prime binds weakly to both resins and elutes 
in the beginning of the gradient. This delta prime subunit was stored frozen at - 80°C. 
The final delta prime subunit preparation is shown in lane marked 6' of the Coomassie 
Blue stained SDS-polyacrylamide gel of Figure 14. Yield = 40 mg. 

Example 25 - Identification of the S. pyogenes dnaX Gene and Purification of the 
Tau Subunit 

A search of the S. pyogenes genome database with the putative B. 
subtilis tau amino acid sequence revealed a DNA sequence in contig #284 (previously 
known as contig # 289) with a high scoring match which predicted a gene encoding 
for a tau subunit protein. A set of PCR primers to 5'- and 3'- termini of the putative 
gene sequence were designed to include restriction enzyme recognition sequences for 
Ndel and BamHI sites, respectively. These primers are: 

dnaX Upstream (SEQ. ID. No. 73) 

ggagttaaaa acatatgtat caagctcttt ate 33 
dnaX Downstream (SEQ. ID. No. 74) 

cgtgggtaag ggcaaaaegg atcccttatg tatttcag 3 8 

A PCR product obtained with the above primers was digested with Ndel and BamHI, 
purified and ligated into pETl la vector to produce pETl la.S.p.dnaX. 

The pETl la.S.p.dnaX plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ^ig/ml Ampicillin. Cells 
were grown at 37°C to O.D.600^0.5, at which point the temperature was lowered to 
15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 
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cells were collected by centrifugation and resuspended in 200 mis of 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/IM NaCl/30 mM Spermidine/5 mM 
DTT/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. The supernatant 
5 (2.4 gm) was dialyzed against buffer A containing 50 mM NaCl, loaded onto a 120 ml 

FFQ column (without ammonium sulfate precipitation) and eluted with a linear 
gradient of 100-700 mM NaCl. The peak fractions (fr# 41-55) were combined, 
diluted with buffer A containing no salt (a dilution of 1/5) to a conductivity of 100 
mM NaCl, loaded (310 mg) onto a 300 ml Heparin- Affigel column, and eluted with a 

10 linear gradient of 100-500 mM NaCl. The peak fractions (fr# 21-36) were combined, 

dialyzed against buffer A, loaded (87 mg) onto 10 ml FFQ column, and eluted as 
described for the first FFQ column. The peak fractions (fr# 27-41) were concentrated 
by centrifugation in Centriprep 30 filtration unit and frozen at -80°C. The final tau 
subunit preparation is shown in the lane marked x of the Coomassie Blue stained SDS- 

15 polyacrylamide gel of Figure 14. Yield =103 mg. 



Example 26 - Identification of the S. pyogenes dnaN Gene and Purification of the 
Beta Subunit 

20 A search of the S. pyogenes genome database with the putative B. 

subtilis beta subunit amino acid sequence revealed a DNA sequence (contig # 266) 
with a high scoring match which predicted a gene encoding for a beta subunit protein. 
A set of PCR primers to 5'- and 3'- termini of the putative gene sequence were 
designed to include restriction enzyme recognition sequences for Ndel and BamHI, 

25 respectively. The primers were: 



dnaN Upstream (SEQ. ID. No. 75) 

ggagttcata tgattcaatt ttcaaattaa tcgc 34 

30 dnaN Downstream (SEQ. ID. No. 76) 

tatcagctcc tggatccagt accttccatt gattagcc 38 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
purified and ligated into pET16b vector to produce pET16b.S.p.dnaN. 
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The pET16b.S.p.dnaN plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli, A single colony from an overnight transformation 
was used to innoculate 15L LB broth supplemented with 200 jig/ml Ampicillin. Cells 
were grown at 37°C to O.D.600=0.4, at which the point temperature was lowered to 
5 15°C and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, 

cells were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/5 mM DTT7 30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(15,000 psi), followed by centrifugation at 13,000 rpm for 30 min. Ammonium 

10 sulfate was added to the supernatant to a final concentration of 0.3 g/ml. The resulting 

protein pellet was resuspended and dialyzed against buffer A containing 50 mM NaCl. 
The dialyzed protein (300 mg) was loaded onto a 45 ml FFQ- Sepharose column and 
eluted with a linear gradient from 50 - 500 mM NaCl. The peak fractions (16-30) were 
combined, dialyzed against buffer A containing 50 mM NaCl, loaded onto a 25 ml 

15 EAH-Sepharose column, and eluted with a linear gradient of 50-500 mM NaCl. The 

fractions containing the least contaminants were combined into two pools (pool 110- 
17, pool II 19-27). Each pool was further purified on a 8 ml MonoQ column 
(performed under conditions described for the FFQ column above). The final beta 
subunit preparation is shown in the lane marked p of the Coomassie Blue stained 

20 SDS-polyacrylamide gel of Figure 14. Yield = 48 mg. 



Example 27 - Identification of the S. pyogenes polC Gene and Purification of the 
Alpha-Large Polymerase Subunit 

25 A search of the B. subtilis genome database with the E. coli alpha 

subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding alpha-like polymerase subunits. The DNA 
sequence with the second highest scoring match which encoded the largest of the two 
polymerase subunits also appeared to encode for the epsilon exonuclease domain at 

30 the N- terminus of the putative alpha subunit. A search of the B. subtilis genome 

database with S. pyogenes DNA sequence confirmed this nucleotide sequence to 
encode the Gram positive homolog of the E. coli replicative polymerase subunit 
(alpha). This Gram negative alpha-like subunit lacked homology to epsilon. The 
gene encoding the large alpha polypeptide sequence (alpha-large) will be referred to as 
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the product of the polC gene and the gene encoding the smaller Gram-negative alpha- 
like polymerase (alpha-small) will be referred to as the product of the polE or dnaE 
gene (see Example 28). 

The alpha-large polymerase polypeptide is a product of two 
5 overlapping contigs; contig #197 (renamed #193) encodes the N-terminal 630 amino 

acids, and contig #278 (renamed #273) encodes the C-terminal 1392 amino acids. The 
putative Open Reading Frame generates a 1464 amino acid polypeptide (SEQ. ID. 
No. 18). Since the polC nucleotide sequence contained several Ndel sites, a primer 
was designed to mutate two restriction endonuclease sites in the pETl la nucleotide 

10 sequence upstream of the N-terminus of the gene; an Xbal restriction site was mutated 

to an Nhel restriction site and an Ndel restriction site at the starting ATG was 
removed. A 74mer primer which spans from mutated Xbal site upstream of T7 
promoter includes Nhel site, rbs site (ribosome binding site), mutated Ndel site and 
first 1 0 amino acid codons of polC gene sequence. The following primers were used 

15 in a PCR reaction to amplify polC gene from S. pyogenes genomic DNA: 

yolC Upstream (SEQ. ID. No. 77) 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
tcagatttat tcgc 74 

20 

volC Downstream (SEQ. ID. No. 78) 

cggtgtctct atctaaatga ctcatttggg atcctcgctt tatacggtat gtcacag 57 

Elongase (BRL) produced the best amplification results. PCR reaction conditions 
25 were: 5 jig genomic DNA, 20 ng of each primer, 1 ml Elongase, 60 }xM each dNTP, 

in 100 ml Elongase reaction buffer for 1 min at 94°C, 1 min at 55°C, and 6 min at 
60°C repeated for 40 cycles. The resulting 4000 bp PCR fragment was digested with 
Nhel and BamHI, purified and ligated into the pETl la vector (digested with Xbal and 
BamHI) to produce pETl la.S.p.poIC. 
30 The pETl 1 a.S.p.polC plasmid was transformed into the 

BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 24L LB broth supplemented with 200 ng/ml Ampicillin. Cells 
were grown at 37°C to OD600=0.4 at which point temperature was lowered to 15°C 
and 0.5 mM EPTG was added. Induction proceeded for 16 hrs. In the morning, cells 
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(12g) were collected by centrifugation and resuspended in 100 ml 50 mM Tris-HCl 
(pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/1 M NaCl/5mM DTT/30 mM 
Spermidine/5 mM EDTA. Cells were lysed by two passages through the French press 
(1 5,000 psi), followed by centrifugation at 13,000 rpm for 30 min. Ammonium 
5 sulfate was added to the supernatant to a final concentration of 0.226 g/ml. The 

precipitate was collected by centrifugation. The protein pellet (220 mg resuspended in 
buffer A) was dialyzed against buffer A containing 150 mM NaCl, loaded onto an 8 
ml FFQ column equilibrated with buffer A containing 150 mM NaCl, and eluted with 
a linear gradient of buffer A containing 150-600mM NaCl. The fractions containing 

10 the least contaminants (fr# 42-64) were combined and precipitated with ammonium 

sulfate (0.226 g/ml). The precipitate was collected by centrifugation and resuspended 
in buffer A (10 mg/ml in 5 ml). A fraction (1 ml=10mgs) of the concentrated protein 
was dialyzed, loaded onto 10 ml ssDNA-agarose column, and eluted with a linear 
gradient of 50-500 mM NaCl. The peak fractions (fr# 30-50) were combined and 

15 concentrated with ammonium sulfate (as above). The final alpha-large subunit 

preparation is shown in lane marked of the Coomassie Blue stained SDS- 
polyacrylamide gel of Figure 14. Yield= 4 mgs. 

Example 28 - Identification of the 5. pyogenes dnaE Gene and Purification of the 
20 Alpha-Small Polymerase 

A search of the B. subtilis genome database using the E. coli alpha 
subunit amino acid sequence revealed two DNA sequences with a high scoring match 
which predicted two genes encoding for alpha-like polymerase subunits. The DNA 

25 sequence with the highest scoring match encodes a smaller alpha polymerase which 

does not contain an exonuclease domain. The putative short alpha DNA sequence is 
a product of the open reading frame in contig #253 of the 5. pyogenes genome 
database. A set of PCR primers to 5'- and 3 '-termini of the putative gene sequence 
were designed to include restriction enzyme recognition sequences for Ndel and 

30 BamHI, respectively. The primers were: 



a -short Upstream (SEQ. ID. No. 79) 

gggaacaaga taaccaagga ggaacccatg gttgctcaac ttg 



43 
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a -short Downstream (SEQ. ED. No. 80) 

cgaatagcag cgttcatacc aggatcctcg ccgccactgg 40 

A PCR product obtained with these primers was digested with Ndel and BamHI, 
5 purified and ligated into pETl la vector to produce pETl la.S.p.dnaE. 

The pETlla.S.p.dnaE plasmid was transformed into the 
BL21(DE3)RecA- strain of E. coli. A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplimented with 200 ng/ml Ampicillin. Cells 
were grown at 37°C to OD600=0.4, at which point temperature was lowered to 15°C 

10 and 0.5 mM IPTG was added. Induction proceeded for 16 hrs. In the morning, cells 

were collected by centrifugation and resuspended in 100 mis 50 mM Tris-HCl (pH 
7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM DTT/30 mM Spermidine/IM NaCl/5 
mM EDTA. Cells were lysed by two passages through the French press (15,000 psi), 
followed by centrifugation at 13,000 rpm for 30 min. Ammonium sulfate was added 

15 to the supernatant to a final concentration of 0.226 g/ml. The precipitate was 

collected by centrifugation. The protein pellet (resuspended in buffer A) was then 
dialyzed against buffer A. The dialyzed protein (600 mg) was loaded onto a 30 ml 
FFQ and eluted with a linear gradient of buffer A containing 50-500 mM NaCl. The 
peak fractions (200 mg in fr # 70-79) were dialyzed and loaded onto a 100 ml 

20 Heparin- Affigel column. The fractions containing the least contaminants (100 mg 

from fr # 1 8-30) were pooled and dialyzed against buffer A containing 300 mM NaCl. 
The dialysate (50 mg) was loaded onto a 50 ml ssDNA-agarose column and eluted 
with a linear gradient of 300mM - 1M NaCl. The final alpha-small subunit 
preparation is shown in lane marked a s of the Coomassie Blue stained SDS- 

25 polyacrylamide gel of Figure 14. Yield = 25 mg. 

Example 29 - Identification of the S. pyogenes ssb Gene and Purification of the 
Single Strand DNA-Binding Protein 

30 Search of the S. pyogenes genome using the B. subtilis SSB amino acid 

sequence identified a polypeptide in contig #230(212) as having highest homology to 
single strand binding protein of several Gram negative bacteria. This contig lacked the 
first 26 amino acids at the N-terminus. Circular PCR was employed to identify the 
DNA encoding the N-terminus of the putative SSB protein. S. pyogenes genomic 
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DNA was digested overnight with Apol (5 ^g chromosomal DNA in a 50 ^il reaction). 
The DNA was extracted with phenol and precipitated with ethanol. The Apol 
digested chromosomal DNA was self-ligated to generate circular template for future 
use in the circular PCR. A circular PCR was performed with primers designed to 
5 anneal back-to-back to amplify circularized Apol reaction fragments. The primers 

were: 



ssb.circ Upstream (SEQ. ID. No. 81) 

accattttgg cttttaaagg tacggttaac agcaagtgtg aaggtagcc 4 9 

10 

ssb.circ Downstream (SEQ. ID. No. 82) 

gaacgcgagg cagatttcat taactgtgtg atctggcg 3 8 

The PCR reaction conditions were as follows: 1 00 ng circularized S. pyogenes 
15 genomic DNA, 20 ng each primer, 1 ml Elongase, 60 nM each dNTP, 100 1 

Elongase reaction buffer. Amplification was performed for 40 cycles as follows: 
denature, 1 min at 94°C; anneal, 1 min at 55°C; and extend, 5 min at 68°C. PCR 
products were cloned into the Topo TA vector following instructions of the 
manufacturer (Promega). Several positive clones were sequenced to obtain N- 
20 terminal nucleotide sequence. This information lead to design of the following 

primers with which the use of a standard PCR reaction generated whole ssb gene 
products. The primers were: 



ssb Upstream (SEQ. ID. No. 83) 
25 tttaaaagag ggtagcatat gattaataat gtagtactag ttggtcgc 4 8 

ssb Downstream (SEQ. ID. No. 84) 

tttaaattta aacctaggtt caatccattc tgactagaat ggaagatcgt c 51 

30 The resulting PCR product was digested with Ndel and BamHI, purified and ligated 

into pETl la vector to produce pETl la.S.p. ssb. 

The pETl la.S.p.ssb plasmid was transformed into the 
BL21 (DE3)RecA- strain of E. colL A single colony from an overnight transformation 
was used to innoculate 12L LB broth supplemented with 200 ^ig/ml Ampicillin. Cells 
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were grown at 37°C to OD600=0.5 5 at which point 0.5 mM IPTG was added. At the 
end of the 3 hr induction, cells were collected by centrifiigation and resuspended in 
100 ml of 50 mM Tris-HCl (pH 7.5)/ 10% Sucrose /IX Heat Lysis Buffer/5 mM 
DTT/5 mM EDTA. The cell lysis was initiated upon addition of 0.4 mg/ml lysozyme 
5 followed by a 1 hr incubation on ice. The lysate was clarified by centrifiigation at 

13,000 rpm for 30 min. The SSB protein was significantly purified by sequential 
fractionation with ammonium sulfate in the following manner. Solid ammonium 
sulfate was added to the clarified lysate to a final concentration of 0.24 g/ml and the 
precipitated protein was collected by centrifiigation at 13,000 rpm for 30 min. The 

10 resulting pellet was homogenized in buffer A(0.1 M NaCl) + 0.24 g/ml ammonium 

sulfate and the precipitate was collected by centrifiigation. This procedure was 
repeated with buffer A(0.1 M NaCl) + 0.2 g/ml ammonium sulfate, buffer A(0.1 M 
NaCl + 0.15 g/ml ammonium sulfate, and buffer A(0.1 M NaCl) +0.13 g/ml 
ammonium sulfate. The final pellet was resuspended in buffer A + 0.15 M NaCl and 

15 dialyzed against the same buffer. The resulting pellet was resuspended in buffer A 

and dialyzed against buffer A containing 500 mM NaCl. The dialysate (300 mg) was 
diluted to 0.15 M NaCl before it was loaded onto a 20 ml MonoQ column and eluted 
with a linear gradient of 0.15 M - 0.5 M NaCl in buffer A. The SSB protein elutes in 
the very beginning of the gradient. The peak fractions were combined (1 50 mg in 

20 fractions 16-30), diluted to 0.05 M NaCl, loaded onto a 10 ml ssDNA-agarose 

column, and eluted with 0.5 M NaCl. The peak fractions (32-62) were combined and 
frozen. The SSB was further purified over a MonoQ column to remove contaminating 
polymerase activity. The final single strand DNA binding protein preparation is shown 
in lane marked ssb of the Coomassie Blue stained SDS-polyacrylamide gel of Figure 

25 14. Yield =120 mg. 

Example 30 - First Demonstration that S. pyogene holA Encodes a Delta Subunit 
Involved In Replication: Assembly of x55' Complex 

30 Gel filtration is a standard analytical technique to demonstrate direct 

protein-protein interaction. Purified x, 5, 5' proteins were used to examine whether 
they form a protein complex assembly. Gel filtration of t mixed with either 5, 8\or 
both 5 and 5' was performed using an HR 10/30 Superose 6 column equilibrated with 
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buffer A containing 100 mM NaCl. Either 6 (200 jig), 5' (200 ng), or a mixture 
of 5 and 5' (200 ng each) was incubated for 30 min at 15°C in 100 jil of buffer A 
containing 100 mM NaCl, and the entire mixture was injected onto the column. The 
mixture was resolved on the column by collection of 170 \xl fractions after the initial 
5 void (6.6 |il) volume was collected. Fractions were analyzed by 10% SDS- 

polyacrylamide gels (30 ^il/lane) stained with Coomassie Blue. 

The results, in Figure 1 5, demonstrate that under these conditions 
the x protein exhibits no (weak) interaction with the delta (Figure 15B) and the delta 
prime subunits (Figure 1 5C) individually, and yet assembles readily into a complex 

10 when all the subunits are mixed in the reaction (Figure 15 A). The t protein was 

mixed with a 2-fold molar excess of each 5 and S\ then gel filtered. A complex 
of x55' was formed as demonstrated by coellution of 5 and 5 1 with x (fr# 22-30) whereas 
excess 55 ? complex elutes in later fractions (fr#38-46). To determine whether 
individual 5 or 5' subunits interact with x, the x subunit was mixed with either 6 or 6' 

15 and then gel filtered. The results demonstrate that a gel filterable complex does not 

form when x is mixed with 5 (Figure 15B) or 5' (Figure 15C) subunits individually, as 
indicated by the absence of these subunits in the x containing fractions (fr#20-26). 
Therefore, it appears that the presence of both 5 and 6' subunits is essential for the 
formation of the x85 f complex. 

20 

Example 31 - Second Demonstration that S. pyogenes holA Encodes Delta: 
Functional Assembly of P on DNA 

Gel filtration was used to demonstrate that the x, 5, 5' proteins form a 
25 functional clamp loading complex which is able to load the p clamp onto a circular 

DNA molecule. The reaction contained 0.5 pmol of gp2 nicked pBluescript plasmid 
(a circular double strand plasmid with a single nick produced by Ml 3 gp2 protein), 1 
pmol [ 32 P]p, 0.5 pmol x65' complex, 0.25 pmol of either 5, 5', x were used in 
individual experiments when a subassembly of the complex was tested (x5, xS\ 55') in 
30 75 nl buffer B (20 mM Tris-HCl (pH 7.5), 20 % glycerol, 0.1 mM EDTA, 5 mM 

DTT, 2 mM ATP, 8 mM MgCl2). P was incubated with nicked DNA for 10 min at 
37°C either alone, or in combination with various assemblies of the x complex. All gel 
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filtration experiments were performed at 4°C. The reaction mixtures were applied to a 
5 ml column of Bio-Gel 15M (Bio-Rad) equilibrated in buffer B containing 100 mM 
NaCl. Fractions of 170 jil were collected and quantitated in the Scintillation counter. 

The results, in Figure 16, demonstrate that the assembly of the ring 
5 onto a circular DNA molecule requires the presence of x, 5, and 6' proteins 

(Figure 16A). In absence of any one of the subunits, loading onto DNA does not 
occur (Figure 16B-E). The clamp loader complex (x56 f ) can be supplied as a mixture 
of x, 8, 5* subunits or as an assembled complex (purified from unassembled subunits by 
gel filtration, or by ion exchange chromatography on MonoQ). Proteins bound to the 
10 large DNA molecule elute in the early fractions (void fr# 10-17) and resolve from free 

proteins that elute in later fractions (fr# 18-35). 

Example 32 - The t Subunit Product of the dnaX Gene Binds a -large 



15 The interaction of S. pyogenes a and x proteins was examined by 

analyzing a mixture of the proteins by gel filtration. Gel filtration of x, a -large or a 
mixture of a-large and x was performed using an HR 10/30 Superose 6 column 
equilibrated with buffer A containing 100 mM NaCl. Either a-large (400 jig) (200 
fiM) or a mixture of a-large and x was incubated for 30 min at 1 5°C in 1 00 p.1 of 

20 buffer A containing 1 00 mM NaCl, and the entire mixture was injected onto the 

column. The mixture was resolved on the column by collection of 170 fxl fractions 
after the initial void (6.6 ml) volume was collected. Fractions were analyzed by 10% 
SDS-polyacrylamide gels (30 (il/lane) stained with Coomassie Blue. 

The results show a complex of ot^x was formed as demonstrated by 

25 coellution of a-large and x (fr# 30-38) proteins (Figure 17A) compared to the elution 

profile of individual proteins (Figure 17B-C). Also, the migration of the x in the cxlt 
complex changes significantly to a larger complex (4 fractions, from fr# 37 to fr# 33). 
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Example 33 - Formation of aixSS' Complex 

To determine whether a ajjiSd 1 complex could form, the following 
components were mixed: a -large (400 |ig, 2.5 nmol), x (200 ng, 1.3 nmol), 6 (200 jug, 
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4.8 nmol), 5' (200 jig, 5.75 pmol) in a final volume of 150 The mixture was diluted 
to 300 ml with buffer A to lower conductivity of the sample to that equivalent of 100 
mM NaCl and incubated for 30 min at 15°C. The mixture was injected onto a 
Superose 6 column (equilibrated with buffer A containing 100 mM NaCl) and 
5 fractions (170 |il) were collected after an initial 6.6 ml of void volume was collected. 

Fractions were analyzed by 1 0% SDS-polyacrylamide gels (30 jil/lane) stained with 
Coomassie Blue. 

A gel filterable complex (Figure 1 8A) of cc]_T55 f was formed as 
demonstrated by coellution of x, 5 and 5' with a -large (fr# 14-26), whereas 

10 excess 55' complex elutes in later fractions (fr# 30-38). The migration of 

the t55' protein complex in the a^rSS' complex does not change significantly. The 
complex might dissociate under the nonequilibrium conditions of gel filtration due to 
low concentration of proteins, salt concentration and speed of resolution. 

Next, ion exchange chromatography was used to analyze the protein 

15 mixture to prepare the reconstituted oiuidS 1 complex of S. pyogenes. The airSS 1 

complex was reconstituted upon mixing a -large (10 mg, 62 nmol), t (6 mg, 72 nmol), 
5 (3.3 mg, 80 nmol), 5 f (1.6 mg, 90 nmol). The a, x, 5, 8' protein mixture was dialyzed 
for 2 hrs against buffer A containing 50 mM NaCl. The entire mixture was loaded 
onto a 1 ml MonoQ column equilibrated in buffer A containing 50 mM NaCl. Proteins 

20 were eluted with a 20 column volume linear gradient of 50-500 mM NaCl in buffer A 

and 0. 25 ml fractions were collected. Fractions were analyzed by 10% SDS- 
polyacrylamide gels (20 jil/lane) stained with Coomassie Blue. 

Generally, the reconstitution of the a^rSS' complex on a MonoQ 
column results in a tight salt resistant complex (Figure 1 8B, fr# 23-35) which elutes at 

25 500 mM NaCl. The high concentration of the proteins in the eluted fractions 

contributes to stability of the complex. 



30 



Example 34 - The S. pyogenes Three Component Pol III-L Polymerase Is Rapid 
and Processive In DNA Synthesis 

It was previously demonstrated (i.e., in Examples 29 and 30) that the 
putative delta subunit plays an integral part in the assembly of the xS8' complex 
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(Figure 15) and that this complex is sufficient to assemble 3 clamps onto circular 
primed DNA (Figure 16). It was also shown that the strong interaction between the a - 
large and t subunits (Figure 17) results in an isolatable a^S' complex (Figure 18), 
similar to that of the E. coli DNA polymerase HI*. 

The MonoQ fractions containing airSS* complex were then used to 
assemble p onto primed DNA and determine whether this now resulted in rapid and 
processive DNA synthesis. Replication reactions contained 70 ng of singly primed 
M13mpl8 ssDNA and 0.82 *ig of S. pyogenes SSB in 25 jul buffer C (20 mM Tris- 
HC1 (pH 7.5), 4 % glycerol, 0.1 mM EDTA, 5 mM DTT, 2 mM ATP, 8 mM MgCl2) 
with 60 jiM each of dGTP, dCTP, and dATP, 30 ^M cold TTP and 20 \iM [a-32P] 
TTP (specific activity of 2,000-4,000 cpm/pmol). The complex is assembled onto 
DNA in the following manner: 40 ng (3:1) or 140 ng (10:1) of the cl\jz^ complex and 

60 ng of p protein were preincubated for 2 min at 30°C in presence of SSB coated 
primed Ml 3 DNA and two nucleotides (dCTP and dGTP). Reactions were initiated by 
addition of the two remaining nucleotides dATP and TTP and quenched with an equal 
volume of 1% SDS/40 mM EDTA. Each time point is a separate reaction. 

A time course of replication on singly primed circular M13mpl8 
ssDNA is shown in Figure 19. The agarose gel analysis shows conversion of the 
oligonucleotide primed single stranded DNA to the slower migrating replicative form 
II. The fact that the speed of synthesis is independent of the concentration of 
polymerase in the reaction indicates that the airSS' complex synthesizes DNA in a 
rapid and a highly processive manner. The S. pyogenes airSS' complex in presence of 

the p clamp, completely replicates (is able to complete replication of) 7250 nt of 
M13mpl8 ssDNA in 8-9 sec. 

Example 35 - The S. pyogenes DnaE (a-small) Forms a Three-Component 
Polymerase with t§8' and p 

The S. pyogenes DnaE (a-small) polymerase is more homologous to E. coli a 
than S. pyogenes PolC. Thus, it seems reasonable to expect that the DnaE polymerase 
may also function with the p clamp (Figs. 21 A-B). To test DnaE for function with 
t5S* and p, replication reactions contained 70 ng (25 fmol) of 30-mer singly primed 
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M13mpl8 ssDNA, 0.82 jig of S. pyogenes SSB, and 3.3 ng - 300 ng of DnaE (25 finol 
- 2.3 pmol) in 23.5 jil of 20 mM Tris-HCl (pH 7.5), 4% glycerol, 0.1 mM EDTA, 5 
mM dithiothreitol (DTT), 40 \ig/m\ BSA, 2 mM ATP, 8 mM MgCl 2 , and 60 ^M each 
of dGTP and dCTP. When present, reactions included 43.3 ng of p and 10 ng of x55\ 
Reactions were preincubated for 3 min at 37°C, and then NaCl was added to 40 mM 
followed by another 2 min at 37°C. DNA synthesis was initiated upon addition of 1.5 
Hi of 1.5 mM dATP, 0.5 mM [cc 32 P]-dTTP (specific activity 2,000-4,000 cpm/pmol). 
Aliquots of 25 were removed at the indicated times and quenched with an equal 
volume (25 of 1% SDS, 40 mM EDTA. One-half of the quenched reaction was 
analyzed for total deoxynucleotide incorporation using DE81 filter paper and the other 
half was analyzed on a 0.8% neutral agarose gel. The effect of TMAU was also 
examined, in which 100|aM TMAU in DMSO (2% DMSO final concentration) was 
present. In this case, replication was allowed to proceed for 1 min before being 
quenched with 25 j±l of 1% SDS, 40 mM EDTA. 

At a saturating concentration of DnaE polymerase, the time course of primer 
extension shows that it completes an M13mpl8 primed ssDNA template within 2 
minutes for a speed of at least 60 nucleotides/s (Fig. 21C). This rate of synthesis 
holds true for the highest amount of DnaE in the rightmost panel of the figure. As the 
DnaE concentration is decreased, a longer time is required to complete the circular 
template, indicating that the DnaE polymerase is not processive over the entire length 
of the M13mpl8 template. If the DnaE polymerase were fiilly processive during 
synthesis of the 7.2 kb ssDNA circle, the product profile over time would be 
qualitatively similar at all concentrations of enzyme, but the overall intensity of the 
profile would be diminished. This particular experiment was performed in the 
absence of p, but presence of x55\ When repeated in the presence of p but without 
t55', and in the absence of both p and T55', results similar to those shown in Fig. 21 C 
were observed. 

In the presence of p and T88 f , DnaE polymerase is stimulated in synthesis at 
low concentration, indicating that p increases the processivity and/or speed of DnaE 
(Figs. 21C-D). At higher concentrations of DnaE, the presence of p/xS5' has no effect 
on the rate of synthesis, and thus p does not increase the intrinsic speed of the enzyme 
(i.e., panels 3 and 4 of Fig. 2 ID). Hence, the effect of the p clamp on DnaE is 
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primarily due to an increase in processivity. The profile of product length over time 
remains essentially unchanged at the different DnaE concentrations, and therefore the 
processivity of DnaE, with p is at least equal to the 7.2 kb length of the M13mpl8 
substrate. 

The DnaE sequence does not show homology to an exonuclease, implying that 
it may have no associated nuclease activity. The DnaE preparation was examined for 
the presence of a 3'-5 f exonuclease (Fig. 2 IE). The DnaE and PolC polymerases were 
each incubated with a 5' 32P-labeled oligonucleotide, followed by analysis in a 
sequencing gel. The result showed no degradation of the oligonucleotide by DnaE. 
PolC is a known 3-5' exonuclease and it digests the end-labeled oligonucleotide as 
expected. 

Gram positive PolC is known to be inhibited by the antibiotic 
hydroxyphenylaza-uracil ("HPUra") and its derivatives. In Fig. 2 IF, the PolCxS8\ p 
and DnaE were tested for inhibition of synthesis on SSB coated primed M13mpl8 
ssDNA by an HPUra derivative, trimethylanilino-uracil ("TMAU"). The Po1C*t88' p 
enzyme was prevented from forming the RFII product by TMAU. In contrast, the 
DnaE polymerase was not affected by TMAU in the presence of x887p (nor in the 
absence of x88Vp, not shown). 

Although the invention has been described in detail for the purpose of 
illustration, it is understood that such detail is solely for that purpose, and variations 
can be made therein by those skilled in the art without departing from the spirit and 
scope of the invention which is defined by the following claims. 
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WHAT IS CLAIMED : 

1 . An isolated DNA molecule from a Gram positive bacterium, 
the isolated DNA molecule comprising a coding region from a polC gene, a dnaE 

5 gene, a holA gene, a holB gene, a dnaX gene, a dnaN gene, a ^6 gene, a dnaG gene, or 

a c//*<2i? gene. 

2. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the polC gene. 

10 

3. The isolated DNA molecule according to claim 2, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

4. An isolated DNA molecule according to claim 3, wherein the 
15 DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 18. 

5. The isolated DNA molecule according to claim 4, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 17. 

20 6. The isolated DNA molecule according to claim 2, wherein the 

DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 17 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

25 7. The isolated DNA molecule according to claim 1, wherein the 

DNA molecule comprises the coding region from the dnaE gene. 

8. The isolated DNA molecule according to claim 7, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 



30 



9. The isolated DNA molecule according to claim 8, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 20. 
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10. The isolated DNA molecule according to claim 9, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 19. 

1 1 . The isolated DNA molecule according to claim 7, wherein the 
5 DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 19 under 

stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

12. The isolated DNA molecule according to claim 1, wherein the 
10 DNA molecule comprises the coding region from the holA gene. 

13. The isolated DNA molecule according to claim 12, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

15 14. The isolated DNA molecule according to claim 13, wherein the 

DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 22. 

15. The isolated DNA molecule according to claim 14, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 21. 



20 
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16. The isolated DNA molecule according to claim 12, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 21 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

17. The isolated DNA molecule according to claim 12, wherein the 
Gram positive bacterium is Staphylococcus aureus. 



18. The isolated DNA molecule according to claim 17, wherein the 
30 DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 12. 

19. The isolated DNA molecule according to claim 18, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 1 1 . 
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20. The isolated DNA molecule according to claim 12, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 1 1 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 

5 SSC buffer at a temperature of 3 7°C. 

21 . The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding regiong from the holB gene. 

10 22. The isolated DNA molecule according to claim 21, wherein the 

Gram positive bacterium is Streptococcus pyogenes. 

23. The isolated DNA molecule according to claim 22, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 24. 

15 

24. The isolated DNA molecule according to claim 23, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 23. 

25. The isolated DNA molecule according to claim 21 , wherein the 
20 DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 23 under 

stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

26. The isolated DNA molecule according to claim 21, wherein the 
25 Gram positive bacterium is Staphylococcus aureus. 

27. The isolated DNA molecule according to claim 26, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 14. 



30 28. The isolated DNA molecule according to claim 27, wherein the 

DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 13. 
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29. The isolated DNA molecule according to claim 21, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ED. No. 13 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

30. The isolated DNA molecule according to claim 1 , wherein the 
DNA molecule comprises the coding region from the dnaX gene. 



3 1 . The isolated DNA molecule according to claim 30, wherein the 
10 Gram positive bacterium is Streptococcus pyogenes. 

32. The isolated DNA molecule according to claim 31, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 26. 

15 33. The isolated DNA molecule according to claim 32, wherein the 

DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 25. 

34. The isolated DNA molecule according to claim 30, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 25 under 

20 stringent conditions characterized by use of a hybridization buffer comprising 0.9M 

SSC buffer at a temperature of 37°C. 

35. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the dnaN gene. 

25 

36. The isolated DNA molecule according to claim 35, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

37. The isolated DNA molecule according to claim 36, wherein the 
30 DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 28. 



38. The isolated DNA molecule according to claim 37, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. DD. No. 27. 
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39. The isolated DNA molecule according to claim 35, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 27 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 

5 SSC buffer at a temperature of 37°C. 

40. The isolated DNA molecule according to claim 1, wherein the 
DNA molecule comprises the coding region from the ssb gene. 

10 41 . The isolated DNA molecule according to claim 40, wherein the 

Gram positive bacterium is Streptococcus pyogenes. 

42. The isolated DNA molecule according to claim 41, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 30. 

15 

43. The isolated DNA molecule according to claim 42, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 29. 

44. The isolated DNA molecule according to claim 40, wherein the 
20 DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 29 under 

stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

45. The isolated DNA molecule according to claim 1, wherein the 
25 DNA molecule comprises the coding region from the dnaG gene. 

46. The isolated DNA molecule according to claim 45, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 
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47. The isolated DNA molecule according to claim 46, wherein the 
DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 32. 
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48. The isolated DNA molecule according to claim 47, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 31. 

49. The isolated DNA molecule according to claim 45, wherein the 
5 DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 31 under 

stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

50. The isolated DNA molecule according to claim 1, wherein the 
10 DNA molecule comprises the coding region from the dnaB gene. 

5 1 . The isolated DNA molecule according to claim 50, wherein the 
Gram positive bacterium is Streptococcus pyogenes. 

15 52. The isolated DNA molecule according to claim 51, wherein the 

DNA molecule encodes an amino acid sequence comprising SEQ. ID. No. 34. 

53. The isolated DNA molecule according to claim 52, wherein the 
DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 33. 

20 

54. The isolated DNA molecule according to claim 50, wherein the 
DNA molecule hybridizes to a nucleic acid molecule of SEQ. ID. No. 33 under 
stringent conditions characterized by use of a hybridization buffer comprising 0.9M 
SSC buffer at a temperature of 37°C. 

25 

55. An expression system comprising an expression vector into 
which is inserted a heterologous DNA molecule according to claim 1. 

56. The expression system according to claim 55, wherein the 
30 heterologous DNA molecule is in sense orientation and correct reading frame. 

57 A host cell comprising a heterologous DNA molecule 
according to claim 1 . 



5 
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58. An isolated protein or polypeptide from a Gram positive 
bacterium, wherein the isolated protein or polypeptide is alpha-large, alpha-small, 
delta, delta prime, tau, beta, SSB, DnaG, or DnaB. 

59. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-large. 



60. The isolated protein or polypeptide according to claim 59, 
10 wherein the Gram positive bacterium is Streptococcus pyogenes. 

61 . The isolated protein or polypeptide according to claim 60, 
wherein the alpha-large protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 18. 

15 

62. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is alpha-small. 

63. The isolated protein or polypeptide according to claim 62, 
20 wherein the Gram positive bacterium is Streptococcus pyogenes. 

64. The isolated protein or polypeptide according to claim 63, 
wherein the alpha-small protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 20. 

25 

65. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta. 

66. The isolated protein or polypeptide according to claim 65, 
30 wherein the Gram positive bacterium is Streptococcus pyogenes. 
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67. The isolated protein or polypeptide according to claim 66, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 
ED. No. 22. 

5 68. The isolated protein or polypeptide according to claim 65, 

wherein the Gram positive bacterium is Staphylococcus aureus. 

69. The isolated protein or polypeptide according to claim 68, 
wherein the delta protein or polypeptide comprises an amino acid sequence of SEQ. 

10 ID. No. 12. 

70. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is delta prime. 

15 71 . The isolated protein or polypeptide according to claim 70, 

wherein the Gram positive bacterium is Streptococcus pyogenes. 

72. The isolated protein or polypeptide according to claim 71, 
wherein the delta prime protein or polypeptide comprises an amino acid sequence of 

20 SEQ. ID. No. 24. 

73. The isolated protein or polypeptide according to claim 70, 
wherein the Gram positive bacterium is Staphylococcus aureus. 

25 74. The isolated protein or polypeptide according to claim 73, 

wherein the delta prime protein or polypeptide comprises an amino acid sequence of 
SEQ. ID. No. 14. 

75. The isolated protein or polypeptide according to claim 58, 
30 wherein the isolated protein or polypeptide is tau. 



76. The isolated protein or polypeptide according to claim 75, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 



5 
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77. The isolated protein or polypeptide according to claim 76, 
wherein the tau protein or polypeptide comprises an amino acid sequence of SEQ. ID. 
No. 26. 

78. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is beta. 



79. The isolated protein or polypeptide according to claim 78, 
10 wherein the Gram positive bacterium is Streptococcus pyogenes. 

80. The isolated protein or polypeptide according to claim 79, 
wherein the beta protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 28. 

15 

81 . The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is SSB. 

82. The isolated protein or polypeptide according to claim 81, 
20 wherein the Gram positive bacterium is Streptococcus pyogenes. 

83. The isolated protein or polypeptide according to claim 82, 
wherein SSB comprises an amino acid sequence of SEQ. ID. No. 30. 

25 84. The isolated protein or polypeptide according to claim 58, 

wherein the isolated protein or polypeptide is DnaG. 

85. The isolated protein or polypeptide according to claim 84, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 



30 



86. The isolated protein or polypeptide according to claim 85, 
wherein the DnaG protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 32. 
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87. The isolated protein or polypeptide according to claim 58, 
wherein the isolated protein or polypeptide is DnaB. 

88. The isolated protein or polypeptide according to claim 87, 
wherein the Gram positive bacterium is Streptococcus pyogenes. 

89. The isolated protein or polypeptide according to claim 88, 
wherein the DnaB protein or polypeptide comprises an amino acid sequence of SEQ. 
ID. No. 34. 

90. A method of identifying compounds which inhibit the activity 
of a polymerase product of polC or dnaE comprising: 

forming a reaction mixture comprising a primed DNA molecule, a 
polymerase product of polC or dnaE, 2l candidate compound, a dNTP, and optionally 
either a beta subunit, a tau complex, or both the beta subunit and the tau complex, 
wherein at least one of the polymerase product of polC or dnaE, the beta subunit, the 
tau complex, or a subunit or combination of subunits thereof is derived from a 
Eubacteria other than Escherichia coli; 

subjecting the reaction mixture to conditions effective to achieve 
nucleic acid polymerization in the absence of the candidate compound; 

analyzing the reaction mixture for the presence or absence of nucleic 
acid polymerization extension products; and 

identifying the candidate compound in the reaction mixture where there 
is an absence of nucleic acid polymerization extension products. 



91 . The method according to claim 90, wherein the polymerase 
product of polC or dnaE is from a Streptococcus bacterium or a Staphylococcus 
bacterium. 
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FIGURE 20D 
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FIGURE 20F 
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FIGURE 20H 
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SEQUENCE LISTING 
<110> The Rockefeller University 

<120> DNA REPLICATION PROTEINS OF GRAM POSITIVE BACTERIA AND 
THEIR USE TO SCREEN FOR CHEMICAL INHIBITORS 

<130> 22221/1022 

<140> 
<141> 

<150> 60/146,178 
<151> 1999-07-29 

<160> 84 

<170> Pa ten tin Ver. 2-1 

<210> 1 
<211> 3195 
<212> DNA 

<213> Staphylococcus aureus 
<400> 1 

atggtggcat atttaaatat tcatacggct tatgatttgt taaattcaag cttaaaaata 60 

gaagatgccg taagacttgc tgtgtctgaa aatgttgatg cacttgccat aactgacacc 120 

aatgtattgt atggttttcc taaattttat gatgcatgta tagcaaataa cattaaaccg 180 

atttttggta tgacaatata tgtgacaaat ggattaaata cagtcgaaac agttgttcta 240 

gctaaaaata atgatggatt aaaagatttg tatcaactat catcggaaat aaaaatgaat 3 00 

gcattagaac atgtgtcgtt tgaattatta aaacgatttt ctaacaatat gattatcatt 3 60 

tttaaaaaag tcggtgatca acatcgtgat attgtacaag tgtttgaaac ccataatgac 420 

acatatatgg accaccttag tatttcgatt caaggtagaa aacatgtttg gattcaaaat 4 80 

gtttgttacc aaacacgtca agatgccgat acgatttctg cattagcagc tattagagac 540 

aatacaaaat tagacttaat tcatgatcaa gaagattttg gtgcacattt tttaactgaa 600 

aaggaaatta atcaattaga tattaaccaa gaatatttaa cgcaggttga tgttatagct 660 

caaaagtgtg atgcagaatt aaaatatcat caatctctac ttcctcaata tgagacacct 72 0 

aatgatgaat cagctaaaaa atatttgtgg cgtgtcttag ttacacaatt gaaaaaatta 7 80 

gaacttaatt atgacgtcta tttagagcga ttgaaatatg agtataaagt tattactaat 840 

atgggttttg aagattattt cttaatagta agtgatttaa tccattatgc gaaaacgaat 900 

gatgtgatgg taggtcctgg tcgtggttct tcagctggct cactggtcag ttatttattg 960 

ggaattacaa cgattgatcc tattaaattc aatctattat ttgaacgttt tttaaaccca 1020 

gaacgtgtaa caatgcctga tattgatatt gactttgaag atacacgccg agaaagggtc 1080 

attcagtacg tccaagaaaa atatggcgag ctacatgtat ctggaattgt gactttcggt 1140 

catctgcttg caagagcagt tgctagagat gttggaagaa ttatggggtt tgatgaagtt 1200 

acattaaatg aaatttcaag tttaatccca cataaattag gaattacact tgatgaagca 1260 

tatcaaattg acgattttaa agagtttgta catcgaaacc atcgacatga acgctggttc 1320 

agtatttgta aaaagttaga aggtttacca agacatacat ctacacatgc ggcaggaatt 13 80 
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attattaatg 



accatccatt 



atatgaatat 



gcccctttaa 



cgaaagggga tacaggatta 



1440 



ttaacgcaat ggacaatgac tgaagccgaa cgtattgggt tattaaaaat agattttcta 1500 

gggttgagaa acttatcgat tattcatcaa atcttaacac aagtcaaaaa agatttaggt 1560 

attaatattg atatcgaaaa gattccgttt gatgatcaaa aagtgtttga attgttgtcg 1620 

caaggagata cgactggcat attccaatta gagtctgacg gtgtaagaag tgtattaaaa 1680 

aaattaaagc cggaacactt tgaagatatt gttgctgtaa cttctttgta tagaccaggt 1740 

ccaatggaag aaattccaac ttacattaca agaagacatg atccaagcaa agttcaatat 1800 

ttacatccgc atttagaacc tatattaaaa aatacttacg gtgttattat ttatcaagag 1860 

caaattatgc aaatagcgag cacatttgca aacttcagtt atggtgaagc ggatatttta 1920 

agaagagcaa tgagtaaaaa aaatagagct gttcttgaaa gtgagcgtca acattttata 19 80 

gaaggtgcaa agcaaaatgg ttatcacgaa gacattagta agcaaatatt tgatttgatt 2040 

ctgaaatttg ctgattatgg ttttcctaga gcacatgctg tcagctattc taaaafctgca 2100 

tacattatga gctttttaaa agtccattat ccaaattatt tttacgcaaa tattttaagt 2160 

aatgttattg gaagtgagaa gaaaactgct caaatgatag aagaagcaaa aaaacaaggt 2220 

atcactatat tgccaccgaa cattaacgaa agtcattggt tttataaacc ttcccaagaa 22 80 

ggcatttatt tatcaattgg tacaattaaa ggtgttggtt atcaaagtgt gaaagtgatt 2340 

gttgatgaac gttatcagaa cggcaaattt aaagatttct ttgafctttgc tagacgtata 2400 

ccgaagagag tcaaaacgag aaagttactt gaagcactga ttttagtggg agcgtttgat 2460 

gctttfcggta aaacacgttc aacgttgttg caagctattg atcaagtgtt ggatggcgat 2520 

ttaaacattg aacaagatgg tttttfcattt gatattttaa cgccaaaaca gatgtatgaa 2580 

gataaagaag aattgcctga tgcacttatt agtcagtacg aaaaagaata tttaggattt 2640 

tatgtttcgc aacacccagt agataaaaag tttgttgcca aacaatattt aacgatattt 27 00 

aaattgagta acgcgcagaa ttataaacct atattagtac agtttgataa agttaaacaa 2760 

attcgaacta aaaatggtca aaatatggca ttcgtcacat taaatgatgg cattgaaact 2820 

ttagatggtg tgattttccc taatcagttt aaaaagtacg aagagttgtt atcacataat 28 80 

gacttgttta tagttagcgg gaaatttgac catagaaagc aacaacgtca actaattata 2940 

aatgagattc agacattagc cacttttgaa gaacaaaaat tagcatttgc caaacaaatt 3000 

ataattagaa ataaatcaca aatagatatg tttgaagaga tgattaaagc tacgaaagag 3 060 

aatgctaatg atgttgtgtt atcctfcttat gatgaaacga ttaaacaaat gactacttta 3120 

ggctatatta atcaaaaaga tagtatgttt aataatttta tacaatcctt taaccctagt 3180 

gatattaggc ttata 3195 



<210> 2 
<211> 1065 
<212> PRT 

<213> Staphylococcus aureus 
<400> 2 

Met Val Ala Tyr Leu Asn lie His Thr Ala Tyr Asp Leu Leu Asn Ser 
15 10 15 

Ser Leu Lys lie Glu Asp Ala Val Arg Leu Ala Val Ser Glu Asn Val 



20 



25 



30 



Asp Ala Leu Ala lie Thr Asp Thr Asn Val Leu 
35 40 



Tyr Gly Phe Pro Lys 
45 
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Phe Tyr Asp Ala Cys lie Ala Asn Asn lie Lye Pro He Phe Gly Met 
50 55 60 

Thr He Tyr Val Thr Asn Gly Leu Asn Thr Val Glu Thr Val Val Leu 
65 70 75 80 

Ala Lys Asn Asn Asp Gly Leu Lys Asp Leu Tyr Gin Leu Ser Ser Glu 
85 90 95 

lie Lys Met Asn Ala Leu Glu His Val Ser Phe Glu Leu Leu Lys Arg 
100 105 110 

Phe Ser Asn Asn Met lie lie lie Phe Lys Lys Val Gly Asp Gin His 
115 120 125 

Arg Asp He Val Gin Val Phe Glu Thr His Asn Asp Thr Tyr Met Asp 
130 135 140 

His Leu Ser lie Ser lie Gin Gly Arg Lys His Val Trp He Gin Asn 
145 150 155 160 

Val Cys Tyr Gin Thr Arg Gin Asp Ala Asp Thr He Ser Ala Leu Ala 
165 170 175 

Ala He Arg Asp Asn Thr Lys Leu Asp Leu He His Asp Gin Glu Asp 
180 185 190 

Phe Gly Ala His Phe Leu Thr Glu Lys Glu He Asn Gin Leu Asp He 
195 200 205 

Asn Gin Glu Tyr Leu Thr Gin Val Asp Val He Ala Gin Lys Cys Asp 
210 215 220 

Ala Glu Leu Lys Tyr His Gin Ser Leu Leu Pro Gin Tyr Glu Thr Pro 
225 230 235 240 

Asn Asp Glu Ser Ala Lys Lys Tyr Leu Trp Arg Val Leu Val Thr Gin 
245 250 255 

Leu Lys Lys Leu Glu Leu Asn Tyr Asp Val Tyr Leu Glu Arg Leu Lys 
260 265 270 

Tyr Glu Tyr Lys Val He Thr Asn Met Gly Phe Glu Asp Tyr Phe Leu 
275 280 285 

He Val Ser Asp Leu He His Tyr Ala Lys Thr Asn Asp Val Met Val 
290 295 300 
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Gly Pro Gly Arg Gly Ser Ser Ala Gly Ser Leu Val Ser Tyr Leu Leu 
305 310 315 320 

Gly He Thr Thr He Asp Pro He Lys Phe Asn Leu Leu Phe Glu Arg 

325 330 335 

Phe Leu Asn Pro Glu Arg Val Thr Met Pro Asp lie Asp lie Asp Phe 
340 345 350 

Glu Asp Thr Arg Arg Glu Arg Val lie Gin Tyr Val Gin Glu Lys Tyr 
355 360 365 

Gly Glu Leu His Val Ser Gly lie Val Thr Phe Gly His Leu Leu Ala 
370 375 380 

Arg Ala Val Ala Arg Asp Val Gly Arg He Met Gly Phe Asp Glu Val 
385 390 395 400 

Thr Leu Asn Glu He Ser Ser Leu He Pro His Lys Leu Gly He Thr 
405 410 415 

Leu Asp Glu Ala Tyr Gin He Asp Asp Phe Lys Glu Phe Val His Arg 
420 425 430 

Asn His Arg His Glu Arg Trp Phe Ser lie Cys Lys Lys Leu Glu Gly 
435 440 445 

Leu Pro Arg His Thr Ser Thr His Ala Ala Gly He He He Asn Asp 
450 455 460 

His Pro Leu Tyr Glu Tyr Ala Pro Leu Thr Lys Gly Asp Thr Gly Leu 
465 470 475 480 

Leu Thr Gin Trp Thr Met Thr Glu Ala Glu Arg He Gly Leu Leu Lys 
485 490 495 

He Asp Phe Leu Gly Leu Arg Asn Leu Ser He He His Gin He Leu 
500 505 510 

Thr Gin Val Lys Lys Asp Leu Gly He Asn He Asp He Glu Lys He 
515 520 525 

Pro Phe Asp Asp Gin Lys Val Phe Glu Leu Leu Ser Gin Gly Asp Thr 
530 535 540 

Thr Gly He Phe Gin Leu Glu Ser Asp Gly Val Arg Ser Val Leu Lys 
545 550 555 560 
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Lys Leu Lys Pro Glu His Phe Glu Asp He Val Ala Val Thr Ser Leu 
565 570 575 

Tyr Arg Pro Gly Pro Met Glu Glu He Pro Thr Tyr He Thr Arg Arg 
580 585 590 

His Asp Pro Ser Lys Val Gin Tyr Leu His Pro His Leu Glu Pro He 
595 600 605 

Leu Lys Asn Thr Tyr Gly Val He He Tyr Gin Glu Gin He Met Gin 
610 615 620 

lie Ala Ser Thr Phe Ala Asn Phe Ser Tyr Gly Glu Ala Asp He Leu 
625 630 635 640 

Arg Arg Ala Met Ser Lys Lys Asn Arg Ala Val Leu Glu Ser Glu Arg 
645 650 655 

Gin His Phe He Glu Gly Ala Lys Gin Asn Gly Tyr His Glu Asp He 
660 665 670 

Ser Lys Gin He Phe Asp Leu He Leu Lys Phe Ala Asp Tyr Gly Phe 
675 680 685 

Pro Arg Ala His Ala Val Ser Tyr Ser Lys He Ala Tyr He Met Ser 
690 695 700 

Phe Leu Lys Val His Tyr Pro Asn Tyr Phe Tyr Ala Asn He Leu Ser 
705 710 715 720 

Asn Val He Gly Ser Glu Lys Lys Thr Ala Gin Met He Glu Glu Ala 
725 730 735 

Lys Lys Gin Gly He Thr He Leu Pro Pro Asn He Asn Glu Ser His 
740 745 750 

Trp Phe Tyr Lys Pro Ser Gin Glu Gly He Tyr Leu Ser He Gly Thr 
755 760 765 

He Lys Gly Val Gly Tyr Gin Ser Val Lys Val He Val Asp Glu Arg 
770 775 780 

Tyr Gin Asn Gly Lys Phe Lys Asp Phe Phe Asp Phe Ala Arg Arg He 
785 790 795 800 

Pro Lys Arg Val Lys Thr Arg Lys Leu Leu Glu Ala Leu He Leu Val 
805 810 815 
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Gly Ala Phe Asp Ala Phe Gly Lys Thr Arg Ser Thr Leu Leu Gin Ala 
820 825 830 

lie Asp Gin Val Leu Asp Gly Asp Leu Asn He Glu Gin Asp Gly Phe 
835 840 845 

Leu Phe Asp He Leu Thr Pro Lys Gin Met Tyr Glu Asp Lys Glu Glu 
850 855 860 

Leu Pro Asp Ala Leu He Ser Gin Tyr Glu Lys Glu Tyr Leu Gly Phe 
865 870 875 880 

Tyr Val Ser Gin His Pro Val Asp Lys Lys Phe Val Ala Lys Gin Tyr 
885 890 895 

Leu Thr lie Phe Lys Leu Ser Asn Ala Gin Asn Tyr Lys Pro lie Leu 
900 905 910 

Val Gin Phe Asp Lys Val Lys Gin lie Arg Thr Lys Asn Gly Gin Asn 
915 920 925 

Met Ala Phe Val Thr Leu Asn Asp Gly Xle Glu Thr Leu Asp Gly Val 
930 935 940 

lie Phe Pro Asn Gin Phe Lys Lys Tyr Glu Glu Leu Leu Ser His Asn 
945 950 955 960 

Asp Leu Phe lie Val Ser Gly Lys Phe Asp His Arg Lys Gin Gin Arg 
965 970 975 

Gin Leu lie lie Asn Glu lie Gin Thr Leu Ala Thr Phe Glu Glu Gin 
980 985 990 

Lys Leu Ala Phe Ala Lys Gin lie lie lie Arg Asn Lys Ser Gin Xle 
995 1000 1005 

Asp Met Phe Glu Glu Met lie Lys Ala Thr Lys Glu Asn Ala Asn Asp 
1010 1015 1020 

Val Val Leu Ser Phe Tyr Asp Glu Thr lie Lys Gin Met Thr Thr Leu 
1025 1030 1035 1040 

Gly Tyr He Asn Gin Lys Asp Ser Met Phe Asn Asn Phe He Gin Ser 
1045 1050 1055 

Phe Asn Pro Ser Asp He Arg Leu He 
1060 1065 
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<210> 3 
<211> 1698 
<212> DNA 

<213> Staphylococcus aureus 
<400> 3 

ttgaattatc aagccttata tcgtatgtac agaccccaaa gtttcgagga tgtcgtcgga 60 
caagaacatg tcacgaagac attgcgcaat gcgatttcga aagaaaaaca gtcgcatgca 12 0 
tatattttta gtggtccgag aggtacgggg aaaacgagta ttgccaaagt gtttgctaaa 180 
gcaatcaact gtttaaatag cactgatgga gaaccttgta atgaatgtca tatttgtaaa 240 
ggcattacgc aggggactaa ttcagatgtg atagaaattg atgctgctag taataatggc 300 
gttgatgaaa taagaaatat tagagacaaa gttaaatatg caccaagtga atcgaaatat 360 
aaagtttata ttatagatga ggtgcacatg ctaacaacag gtgcttttaa tgccctttta 420 
aagacgttag aagaacctcc agcacacgct atttttatat tggcaacgac agaaccacat 480 
aaaatccctc caacaatcat ttctagggca caacgttttg attttaaagc aattagccta 540 
gatcaaattg ttgaacgttt aaaatttgta gcagatgcac aacaaattga atgtgaagat 600 
gaagccttgg catttatcgc taaagcgtct gaagggggta tgcgtgatgc attaagtatt 660 
atggatcagg ctattgcttt cggcgatggc acattgacat tacaagatgc cctaaatgtt 720 
acgggtagcg ttcatgatga agcgttggat cacttgtttg atgatattgt acaaggtgac 780 
gtacaagcat cttttaaaaa ataccatcag tttataacag aaggtaaaga agtgaatcgc 840 
ctaataaatg atatgattta ttttgtcaga gatacgatta tgaataaaac atctgagaaa 900 
gatactgagt atcgagcact gatgaactta gaattagata tgttatatca aatgattgat 960 
cttattaatg atacattagt gtcgattcgt tttagtgtga atcaaaacgt tcattttgaa 1020 
gtattgttag taaaattagc tgagcagatt aagggtcaac cacaagtgat tgcgaatgta 1080 
gctgaaccag cacaaattgc ttcatcgcca aacacagatg tattgttgca acgtatggaa 1140 
cagttagagc aagaactaaa aacactaaaa gcacaaggag tgagtgttgc tcctactcaa 1200 
aaatcttcga aaaagcctgc gagaggtata caaaaatcta aaaatgcatt ttcaatgcaa 1260 
caaattgcaa aagtgctaga taaagcgaat aaggcagata tcaaattgtt gaaagatcat 1320 
tggcaagaag tgattgacca tgcccaaaac aatgataaaa aatcactcgt tagtttattg 13 80 
caaaattcgg aacctgtggc ggcaagtgaa gatcacgtcc ttgtgaaatt tgaggaagag 1440 
atccattgtg aaatcgtcaa taaagacgac gagaaacgta gtagtataga aagtgttgta 1500 
tgtaatatcg ttaataaaaa cgttaaagtt gttggtgtac catcagatca atggcaaaga 1560 
gttcgaacgg agtatttaca aaatcgtaaa aacgaaggcg atgatatgcc aaagcaacaa 1620 
gcacaacaaa cagatattgc tcaaaaagca aaagatcttt tcggtgaaga aactgtacat 1680 
gtgatagatg aagagtga 1698 



<210> 4 
<211> 566 
<212> PRT 

<213> Staphylococcus aureus 
<400> 4 

Leu Asn Tyr Gin Ala Leu Tyr Arg Met Tyr Arg Pro Gin Ser Phe Glu 
15 10 15 

Asp Val Val Gly Gin Glu His Val Thr Lys Thr Leu Arg Asn Ala lie 
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20 25 30 

Sor Lys Glu Lys Gin Ser His Ala Tyr lie Phe Ser Gly Pro Arg Gly 
35 40 45 

Thr Gly Lys Thr Ser lie Ala Lys Val Phe Ala Lys Ala lie Asn Cys 
50 55 60 

Leu Asn Ser Thr Asp Gly Glu Pro Cys Asn Glu Cys His lie Cys Lys 
65 70 75 80 

Gly lie Thr Gin Gly Thr Asn Ser Asp Val lie Glu lie Asp Ala Ala 
85 90 95 

Ser Asn Asn Gly Val Asp Glu lie Arg Asn lie Arg Asp Lys Val Lys 
100 105 110 

Tyr Ala Pro Ser Glu Ser Lys Tyr Lys Val Tyr He He Asp Glu Val 
115 120 125 

His Met Leu Thr Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu 
130 135 140 

Glu Pro Pro Ala His Ala lie Phe lie Leu Ala Thr Thr Glu Pro His 
145 150 155 160 

Lys lie Pro Pro Thr He He Ser Arg Ala Gin Arg Phe Asp Phe Lys 
165 170 175 

Ala He Ser Leu Asp Gin He Val Glu Arg Leu Lys Phe Val Ala Asp 
180 185 190 

Ala Gin Gin lie Glu Cys Glu Asp Glu Ala Leu Ala Phe Xle Ala Lys 
195 200 205 

Ala Ser Glu Gly Gly Met Arg Asp Ala Leu Ser He Met Asp Gin Ala 
210 215 220 

He Ala Phe Gly Asp Gly Thr Leu Thr Leu Gin Asp Ala Leu Asn Val 
225 230 235 240 

Thr Gly Ser Val His Asp Glu Ala Leu Asp His Leu Phe Asp Asp He 
245 250 255 

Val Gin Gly Asp Val Gin Ala Ser Phe Lys Lys Tyr His Gin Phe He 
260 265 270 

Thr Glu Gly Lys Glu Val Asn Arg Leu He Asn Asp Met He Tyr Phe 
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275 280 285 

Val Arg Asp Thr He Met Asn Lys Thr Ser Glu Lys Asp Thr Glu Tyr 
290 295 300 

Arg Ala Leu Met Asn Leu Glu Leu Asp Met Leu Tyr Gin Met He Asp 
305 310 315 320 

Leu He Asn Asp Thr Leu Val Ser lie Arg Phe Ser Val Asn Gin Asn 
325 330 335 

Val His Phe Glu Val Leu Leu Val Lys Leu Ala Glu Gin lie Lys Gly 
340 345 350 

Gin Pro Gin Val lie Ala Asn Val Ala Glu Pro Ala Gin lie Ala Ser 
355 360 365 

Ser Pro Asn Thr Asp Val Leu Leu Gin Arg Met Glu Gin Leu Glu Gin 
370 375 380 

Glu Leu Lys Thr Leu Lys Ala Gin Gly Val Ser Val Ala Pro Thr Gin 
385 390 395 400 

Lys Ser Ser Lys Lys Pro Ala Arg Gly lie Gin Lys Ser Lys Asn Ala 
405 410 415 

Phe Ser Met Gin Gin Tie Ala Lys Val Leu Asp Lys Ala Asn Lys Ala 
420 425 430 

Asp lie Lys Leu Leu Lys Asp His Trp Gin Glu Val lie Asp His Ala 
435 440 445 

Gin Asn Asn Asp Lys Lys Ser Leu Val Ser Leu Leu Gin Asn Ser Glu 
450 455 460 

Pro Val Ala Ala Ser Glu Asp His Val Leu Val Lys Phe Glu Glu Glu 
465 470 475 480 

He His Cys Glu He Val Asn Lys Asp Asp Glu Lys Arg Ser Ser He 
485 490 495 

Glu Ser Val Val Cys Asn He Val Asn Lys Asn Val Lys Val Val Gly 
500 505 510 

Val Pro Ser Asp Gin Trp Gin Arg Val Arg Thr Glu Tyr Leu Gin Asn 
515 520 525 

Arg Lys Asn Glu Gly Asp Asp Met Pro Lys Gin Gin Ala Gin Gin Thr 
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530 535 540 

Asp lie Ala Gin Lys Ala Lys Asp Leu Phe Gly Glu Glu Thr Val His 
545 550 555 560 

Val lie Asp Glu Glu Glx 
565 



<210> 5 
<211> 1398 
<212> DNA 

<213> Staphylococcus aureus 
<400> 5 

atggatagaa tgtatgagca aaatcaaatg 
ttaggttcaa ttattataga tccagaattg 
gagtcgtttt ataggggtgc ccatcaacat 
gataataaag aaattgatgt tgtaacattg 
aatgaagcgg gtggcccgca atatcttgca 
aatgttcagt attatactga tatcgtttct 
actgcagata gtattgccaa tgatggatat 
agtgatgcag aacgtcgaat tttagagcta 
gacattcgag acgtcttagg acaagtgtat 
ggtcaaacac caggtatacc tacaggatat 
aaccgaaatg atttaattat ccttgcagcg 
cttaatattg cacaaaaagt tgcaacgcat 
ctagagatgg gtgctgatca gttagccaca 
tcaaaccgct taagaacggg tactatgact 
gtaggtaaat tatcacgtac gaagattttt 
gatttacgtt ctaaatgtcg tcgattaaag 
gactacttac agttgattca aggtagtggt 
gtttctgaaa tctctcgtac attaaaagca 
gcattaagtc agttatctcg tggtgttgaa 
gatattcgtg aatctggttc gattgagcaa 
gatgattact ataaccgtgg cggcgatgaa 
caaacgaatg atgaaaacgg tgaaattgaa 
acaggcacag ttaagttaca ttttatgaaa 
gcacatgcag atatgatg 



ccgcataaca atgaagctga acagtctgtc 6 0 

attaatacta ctcaggaagt tttgcttcct 12 0 

attttccgtg caatgatgca cttaaatgaa 180 

atggatcaat tatcgacgga aggtacgttg 240 

gagttatcta caaatgtacc aacgacgcga 3 00 

aagcatgcat taaaacgtag attgattcaa 3 60 

aatgatgaac ttgaactaga tgcgatttta 42 0 

tcatcttctc gtgaaagcga tggctttaaa 480 

gaaacagctg aagagcttga tcaaaatagt 54 0 

cgagatttag accaaatgac agcagggttc 600 

cgtccatctg taggtaagac tgcgttcgca 660 

gaagatatgt atacagttgg tattttctcg 720 

cgtatgattt gtagttctgg aaatgttgac 7 80 

gaggaagatt ggagtcgttt tactatagcg 84 0 

attgatgata caccgggtat tcgaattaat 900 

caagaacatg gcttagacat gattgtgatt 960 

tcacgtgcgt ccgataacag acaacaggaa 102 0 

ttagcccgtg aattaaaatg tccagttatc 1080 

caacgacaag ataaacgtcc aatgatgagt 1140 

gatgccgata tcgttgcatt cttataccgt 1200 

gatgatgacg atgatggtgg tttcgagcca 1260 

attatcattg ctaagcaacg taacggtcca 132 0 

caatataata aatttaccga tatcgattat 13 80 

1398 



<210> 6 
<211> 466 
<212> PRT 

<213> Staphylococcus aureus 
<400> 6 

Met Asp Arg Met Tyr Glu Gin Asn Gin Met Pro His Asn Asn Glu Ala 
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15 10 15 

Glu Gin Ser Val Leu Gly Ser lie lie lie Asp Pro Glu Leu lie Asn 
20 25 30 

Thr Thr Gin Glu Val Leu Leu Pro Glu Ser Phe Tyr Arg Gly Ala His 
35 40 45 

Gin His lie Phe Arg Ala Met Met His Leu Asn Glu Asp Asn Lys Glu 
50 55 60 

lie Asp Val Val Thr Leu Met Asp Gin Leu Ser Thr Glu Gly Thr Leu 
65 70 75 80 

Asn Glu Ala Gly Gly Pro Gin Tyr Leu Ala Glu Leu Ser Thr Asn Val 
85 90 95 

Pro Thr Thr Arg Asn Val Gin Tyr Tyr Thr Asp He Val Ser Lys His 
100 105 110 

Ala Leu Lys Arg Arg Leu He Gin Thr Ala Asp Ser lie Ala Asn Asp 
115 120 125 

Gly Tyr Asn Asp Glu Leu Glu Leu Asp Ala lie Leu Ser Asp Ala Glu 
130 135 140 

Arg Arg lie Leu Glu Leu Ser Ser Ser Arg Glu Ser Asp Gly Phe Lys 
145 150 155 160 

Asp lie Arg Asp Val Leu Gly Gin Val Tyr Glu Thr Ala Glu Glu Leu 
165 170 175 

Asp Gin Asn Ser Gly Gin Thr Pro Gly lie Pro Thr Gly Tyr Arg Asp 
180 185 190 

Leu Asp Gin Met Thr Ala Gly Phe Asn Arg Asn Asp Leu He lie Leu 
195 200 205 

Ala Ala Arg Pro Ser Val Gly Lys Thr Ala Phe Ala Leu Asn lie Ala 
210 215 220 

Gin Lys Val Ala Thr His Glu Asp Met Tyr Thr Val Gly He Phe Ser 
225 230 235 240 

Leu Glu Met Gly Ala Asp Gin Leu Ala Thr Arg Met He Cys Ser Ser 
245 250 255 

Gly Asn Val Asp Ser Asn Arg Leu Arg Thr Gly Thr Met Thr Glu Glu 
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260 265 270 

Asp Trp Ser Arg Pho Thr lie Ala Val Gly Lys Leu Ser Arg Thr Lys 
275 280 285 

lie Phe lie Asp Asp Thr Pro Gly He Arg He Asn Asp Leu Arg Ser 
290 295 300 

Lys Cys Arg Arg Leu Lys Gin Glu His Gly Leu Asp Met He Val He 
305 310 315 320 

Asp Tyr Leu Gin Leu He Gin Gly Ser Gly Ser Arg Ala Ser Asp Asn 
325 330 335 

Arg Gin Gin Glu Val Ser Glu He Ser Arg Thr Leu Lys Ala Leu Ala 
340 345 350 

Arg Glu Leu Lys Cys Pro Val He Ala Leu Ser Gin Leu Ser Arg Gly 
355 360 365 

Val Glu Gin Arg Gin Asp Lys Arg Pro Met Met Ser Asp He Arg Glu 
370 375 380 

Ser Gly Ser He Glu Gin Asp Ala Asp He Val Ala Phe Leu Tyr Arg 
385 390 395 400 

Asp Asp Tyr Tyr Asn Arg Gly Gly Asp Glu Asp Asp Asp Asp Asp Gly 
405 410 415 

Gly Phe Glu Pro Gin Thr Asn Asp Glu Asn Gly Glu He Glu He He 
420 425 430 

He Ala Lys Gin Arg Asn Gly Pro Thr Gly Thr Val Lys Leu His Phe 
435 440 445 

Met Lys Gin Tyr Asn Lys Phe Thr Asp He Asp Tyr Ala His Ala Asp 
450 455 460 



Met Met 
465 



<210> 7 

<211> 4308 

<212> DNA 

<213> Staphylococcus aureus 

<400> 7 
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atgacagagc aacaaaaatt taaagtgctt 
gatgctgaaa ttttaaattc aggtgaactg 
acatgggaat ttcatattac attaccacaa 
ataaatgcaa tagagcaaga gtttaaagat 
acaaatggca cgaatcaaga tgaacatgca 
acagctttat ctccaaaagt taaaggtcaa 
aaagtattaa aagtaatggt atcaaatgac 
aatggaagtc ttatcaaagc gtttagaaat 
gaaacaaatg ataatgatca agaacaaaac 
gaagacgaac aaagtgcacg attggcaaca 
gcgaaacaac aagataacaa cgaaagtgct 
caaattgaaa atattaaacc aattgaatct 
gagggtgtca tttttgatat aaacttaaaa 
attaaagtga ctgactatac ggactcttta 
gatgatttag aacattttaa agcgctaagt 
attgaagaag atacatttat tagagattta 
aaaaaagcga caaaaaaaga taaggctgaa 
gcaatgagcc aaatggatgg tatacccaat 
tggggacatc cagccattgc ggttacagac 
cacgcagcag cggaaaaaca tggcattaaa 
gatgatggtg ttccgattgc atacaaacca 
gttgtgttcg acgttgagac aactggttta 
gcagctgtga aagttcataa cggtgaaatc 
catgaacgat tatcggaaac gattatcaat 
gatgcccctg agattgaaga agtacttaca 
ttcgtagcgc ataatgcttc gtttgatatg 
gggtttggac catcaacgaa tggtgttatc 
actgaatatg gtaaacatgg tttgaatttc 
caacatcacc gtgccattta tgatacagaa 
caacaaatga aagaattagg cgtattaaat 
gaagatgcat ataaacgtgc aagacctagt 
ggtcttaaaa atctatttaa aattgtaagt 
cctcgaattc cacgttcatt gttagatgaa 
tgtgatgaag gtgaattatt tacggcagtt 
attgccaaat attatgattt tattgaaatt 
gatagagagc ttattagaga tactgaaaca 
gcaggtgaca cagcgggtat acctgttatt 
catgatggta tcgcacgtaa aattttaata 
tcaactttac cggaagcaca ttttagaact 
ttaggtgaag aaaaagcgca tgaaattgtt 
attgaacgtg ttgttcctat taaagatgaa 
gaagaaatta gagaactaag ttatgcaaat 
caaatcgtaa ttgatcgatt agaaaaagaa 
gtaatttact taatttcgca acgtttagtt 
ggttcccgtg gttcagtagg ttctagtttt 
aacccgttac cgccacacta tatttgtccg 
ggttcagtag gatcaggatt tgatttacct 
cttattaaag aaggacaaga tattccgttt 



gctgatcaaa ttaaaatttc aaatcaatta 60 
acacgtatag atgtttctaa caaaaacaga 120 
ttcttagctc atgaagatta tttattattt 180 
atcgccaacg ttacatgtcg ttttacggta 240 
attaaatact ttgggcactg tattgaccaa 300 
ttgaaacaga aaaagcttat tatgtctgga 3 60 
attgaacgta atcattttga taaggcatgt 420 
tgtggttttg atatcgataa aatcatattc 4 80 
ttagcttctt tagaagcaca tattcaagaa 540 
gagaaacttg aaaaaatgaa agctgaaaaa 600 
gtcgataagt gtcaaattgg taagccgatt 660 
attattgagg aagagtttaa agttgcaata 720 
gaacttaaaa gtggtcgcca tatcgtagaa 780 
gttttaaaaa tgtttactcg taaaaacaaa 840 
gttggtaaat gggttagggc tcaaggtcgt 900 
gttatgatga tgtctgatat tgaagagatt 960 
gaaaagcgtg tagaattcca cttgcatact 1020 
attggtgcgt atgttaaaca ggcagcagac 1080 
cataatgttg tgcaagcatt tccagatgct 1140 
atgatatacg gtatggaagg tatgttagtt 1200 
caagatgtcg tattaaaaga tgctacttat 12 60 
tcaaatcagt atgataaaat catcgagctt 13 2 0 
atcgataagt ttgaaaggtt tagtaatccg 13 80 
ttgacgcata ttactgatga tatgttagta 1440 
gagtttaaag aatgggttgg cgatgcgata 1500 
ggcttcatcg atacgggata tgaacgtctt 1560 
gatactttag aattatctcg tacgattaat 1620 
ttggctaaaa aatatggcgt agaattaacg 1680 
gcaacagctt acattttcat aaaaatggtt 174 0 
cataacgaaa tcaacaaaaa actcagtaat 1800 
catgtcacat taattgtaca aaaccaacaa 1860 
gcatcattgg tgaagtattt ctaccgtaca 192 0 
tatcgtgagg gattattggt aggtacagcg 1980 
atgcagaagg accagagtca agttgaaaaa 2040 
caaccaccgg cactttatca agatttaatt 2100 
ttacatgaaa tttatcaacg tttaatacat 2160 
gcgacaggaa atgcacacta tttgtttgaa 2220 
gcatcacaac ccggcaatcc acttaatcgc 22 80 
acagatgaaa tgttaaacga gtttcatttt 2340 
gtgaaaaata caaacgaatt agcagatcga 2400 
ttatacacac cgcgtatgga aggtgctaac 24 60 
gcgcgtaaac tgtatggtga agacctgcct 2520 
ttaaaaagta ttatcggtaa tggatttgcg 2580 
aaaaaatcat tagatgatgg atacttagtt 2 640 
gtagcgacaa tgactgagat tactgaagta 2700 
aactgtaaaa cgagtgaatt tttcaatgat 2760 
gataagacgt gtgaaacttg tggagcgcca 2 820 
gaaacatttt taggatttaa gggagataaa 2880 
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gttcctgata tcgacttaaa ctttagtggt 
aaagtattat ttggtgagga taaagtattc 
aagactgctt ttggttatgt taaaggttat 
gctgaaatag atcgactcgt taaaggatgt 
ccagggggta ttattgtagt acctgattac 
tatcctgccg atgatcaaaa ttcagcatgg 
catgataatg tattaaaact tgatatactt 
cttcaagatt tatcaggaat tgatccaaaa 
cagatattta gtacacctga aagtttgggt 
ggtacatttg gggtaccaga attcggtaca 
aagccaacaa cattttctga attagttcaa 
tggttaggca atgctcaaga attaattaaa 
ggttgtcgtg atgatatcat ggtttattta 
tttaaaataa tggagtcagt acgtaaaggt 
atgaaagaaa atgaagtgcc agattggtat 
ttccctaaag cccatgcagc agcatacgtt 
gtacatcatc cactttatta ctatgcatct 
ttaatcacga tgattaaaga taaaacaagc 
cgctatatgg atctaggtaa aaaagaaaaa 
gaaatggcgc atcgaggtta tcgaatgcaa 
gaatttatca ttgaaggcga tacacttatt 
gaaaacgttg cgaaacgaat tgttgaagct 
gatttaaaca aaaaagctgg attatctcag 
tcattaccga atttaccaga taaagctcaa 



gaatatcaac cgaatgccca taactacaca 2940 
cgtgcaggta caattggtac tgttgctgaa 3 000 
ttgaatgatc aaggtatcca caaaagaggt 3 060 
acaggtgtta aacgtacaac tggacagcat 3120 
atggatattt atgattttac gccgatacaa 3180 
atgacgacac attttgattt ccattctatt 3 240 
ggacacgatg atccaacaat gattcgtatg 3 300 
acaatacctg tagatgataa agaagttatg 3360 
gttactgaag atgaaatttt atgtaaaaca 3420 
ggattcgtgc gtcaaatgtt agaagataca 34 80 
atctcaggat tatctcatgg tacagatgtg 3540 
accggtatat gtgatttatc aagtgtaatt 3 600 
atgtatgctg gtttagaacc atcaatggct 3660 
aaaggtttaa ctgaagaaat gattgaaacg 3720 
ttagattcat gtcttaaaat taagtacatg 3780 
ttaatggcag tacgtatcgc atatttcaaa 3840 
tactttacaa ttcgtgcgtc agactttgat 3900 
attcgaaata ctgtaaaaga catgtattct 3960 
gacgtattaa cagtcttgga aattatgaat 4 02 0 
ccgattagtt tagaaaagag tcaggcgttc 4080 
ccgccgttca tatcagtgcc tgggcttggc 414 0 
cgtgacgatg gcccattttt atcaaaagaa 42 00 
aaaattattg agtatttaga tgagttaggc 42 60 
ctttcgatat ttgatatg 43 08 



<210> 8 
<211> 1435 
<212> PRT 

<213> Staphylococcus aureus 
<400> 8 

Met Thr Glu Gin Gin Lys Phe Lys Val Leu Ala Asp Gin He Lys He 
15 10 15 

Ser Asn Gin Leu Asp Ala Glu He Leu Asn Ser Gly Glu Leu Thr Arg 
20 25 30 

He Asp Val Ser Asn Lys Asn Arg Thr Trp Glu Phe His He Thr Leu 
35 40 45 

Pro Gin Phe Leu Ala His Glu Asp Tyr Leu Leu Phe He Asn Ala He 
50 55 60 

Glu Gin Glu Phe Lys Asp He Ala Asn Val Thr Cys Arg Phe Thr Val 
65 70 75 80 

Thr Asn Gly Thr Asn Gin Asp Glu His Ala He Lys Tyr Phe Gly His 
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85 90 95 

Cys lie Asp Gin Thr Ala Leu Ser Pro Lys Val Lys Gly Gin Leu Lys 
100 105 110 

Gin Lys Lys Leu lie Met Ser Gly Lys Val Leu Lys Val Met Val Ser 
115 120 125 

Asn Asp lie Glu Arg Asn His Phe Asp Lys Ala Cys Asn Gly Ser Leu 
130 135 140 

lie Lys Ala Phe Arg Asn Cys Gly Phe Asp lie Asp Lys lie lie Phe 
145 150 155 160 

Glu Thr Asn Asp Asn Asp Gin Glu Gin Asn Leu Ala Ser Leu Glu Ala 
165 170 175 

His lie Gin Glu Glu Asp Glu Gin Ser Ala Arg Leu Ala Thr Glu Lys 
180 185 190 

Leu Glu Lys Met Lys Ala Glu Lys Ala Lys Gin Gin Asp Asn Lys Gin 
195 200 205 

Ser Ala Val Asp Lys Cys Gin lie Gly Lys Pro He Gin He Glu Asn 
210 215 220 

He Lys Pro He Glu Ser He He Glu Glu Glu Phe Lys Val Ala He 
225 230 235 240 

Glu Gly Val He Phe Asp He Asn Leu Lys Glu Leu Lys Ser Gly Arg 
245 250 255 

His He Val Glu He Lys Val Thr Asp Tyr Thr Asp Ser Leu Val Leu 
260 265 270 

Lys Met Phe Thr Arg Lys Asn Lys Asp Asp Leu Glu His Phe Lys Ala 
275 280 285 

Leu Ser Val Gly Lys Trp Val Arg Ala Gin Gly Arg He Glu Glu Asp 
290 295 300 

Thr Phe He Arg Asp Leu Val Met Met Met Ser Asp He Glu Glu He 
305 310 315 320 

Lys Lys Ala Thr Lys Lys Asp Lys Ala Glu Glu Lys Arg Val Glu Phe 
325 330 335 

His Leu His Thr Ala Met Ser Gin Met Asp Gly He Pro Asn He Gly 
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340 345 350 

Ala Tyr Val Lys Gin Ala Ala Asp Trp Gly His Pro Ala lie Ala Val 
355 360 365 

Thr Asp His Asn Val Val Gin Ala Phe Pro Asp Ala His Ala Ala Ala 
370 375 380 

Glu Lys His Gly lie Lys Met He Tyr Gly Met Glu Gly Met Leu Val 
385 390 395 400 

Asp Asp Gly Val Pro He Ala Tyr Lys Pro Gin Asp Val Val Leu Lys 
405 410 415 

Asp Ala Thr Tyr Val Val Phe Asp Val Glu Thr Thr Gly Leu Ser Asn 
420 425 430 

Gin Tyr Asp Lys lie He Glu Leu Ala Ala Val Lys Val His Asn Gly 
435 440 445 

Glu lie lie Asp Lys Phe Glu Arg Phe Ser Asn Pro His Glu Arg Leu 
450 455 460 

Ser Glu Thr lie lie Asn Leu Thr His Xle Thr Asp Asp Met Leu Val 
465 470 475 480 

Asp Ala Pro Glu lie Glu Glu Val Leu Thr Glu Phe Lys Glu Trp Val 
485 490 495 

Gly Asp Ala lie Phe Val Ala His Asn Ala Ser Phe Asp Met Gly Phe 
500 505 510 

lie Asp Thr Gly Tyr Glu Arg Leu Gly Phe Gly Pro Ser Thr Asn Gly 
515 520 525 

Val lie Asp Thr Leu Glu Leu Ser Arg Thr lie Asn Thr Glu Tyr Gly 
530 535 540 

Lys His Gly Leu Asn Phe Leu Ala Lys Lys Tyr Gly Val Glu Leu Thr 
545 550 555 560 

Gin His His Arg Ala He Tyr Asp Thr Glu Ala Thr Ala Tyr He Phe 
565 570 575 

He Lys Met Val Gin Gin Met Lys Glu Leu Gly Val Leu Asn His Asn 
580 585 590 

Glu He Asn Lys Lys Leu Ser Asn Glu Asp Ala Tyr Lys Arg Ala Arg 
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595 600 605 

Pro Ser His Val Thr Leu lie Val Gin Asn Gin Gin Gly Leu Lys Asn 
610 615 620 



Leu Phe Lys He Val Ser Ala Ser Leu Val Lys Tyr Phe Tyr Arg Thr 
625 630 635 640 

Pro Arg He Pro Arg Ser Leu Leu Asp Glu Tyr Arg Glu Gly Leu Leu 
645 650 655 

Val Gly Thr Ala Cys Asp Glu Gly Glu Leu Phe Thr Ala Val Met Gin 
660 665 670 

Lys Asp Gin Ser Gin Val Glu Lys lie Ala Lys Tyr Tyr Asp Phe He 
675 680 685 

Glu lie Gin Pro Pro Ala Leu Tyr Gin Asp Leu He Asp Arg Glu Leu 
690 695 700 

He Arg Asp Thr Glu Thr Leu His Glu He Tyr Gin Arg Leu lie His 
705 710 715 720 

Ala Gly Asp Thr Ala Gly He Pro Val He Ala Thr Gly Asn Ala His 
725 730 735 

Tyr Leu Phe Glu His Asp Gly He Ala Arg Lys He Leu He Ala Ser 
740 745 750 



Gin Pro Gly Asn Pro Leu Asn Arg Ser Thr Leu Pro Glu Ala His Phe 
755 760 765 

Arg Thr Thr Asp Glu Met Leu Asn Glu Phe His Phe Leu Gly Glu Glu 
770 775 780 



Lys Ala His Glu He Val Val Lys Asn Thr Asn Glu Leu Ala Asp Arg 
785 790 795 800 

He Glu Arg Val Val Pro He Lys Asp Glu Leu Tyr Thr Pro Arg Met 
805 810 815 

Glu Gly Ala Asn Glu Glu He Arg Glu Leu Ser Tyr Ala Asn Ala Arg 
820 825 830 

Lys Leu Tyr Gly Glu Asp Leu Pro Gin He Val He Asp Arg Leu Glu 
835 840 845 

Lys Glu Leu Lys Ser lie He Gly Asn Gly Phe Ala Val He Tyr Leu 
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850 855 860 

lie Ser Gin Arg Leu Val Lys Lys Ser Leu Asp Asp Gly Tyr Leu Val 
865 870 875 880 

Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr Met Thr Glu 
885 890 895 

lie Thr Glu Val Asn Pro Leu Pro Pro His Tyr lie Cys Pro Asn Cys 
900 905 910 

Lys Thr Ser Glu Phe Phe Asn Asp Gly Ser Val Gly Ser Gly Phe Asp 
915 920 925 

Leu Pro Asp Lys Thr Cys Glu Thr Cys Gly Ala Pro Leu lie Lys Glu 
930 935 940 

Gly Gin Asp lie Pro Phe Glu Lys Phe Leu Gly Phe Lys Gly Asp Lys 
945 950 955 960 

Val Pro Asp lie Asp Leu Asn Phe Ser Gly Glu Tyr Gin Pro Asn Ala 
965 970 975 

His Asn Tyr Thr Lys Val Leu Phe Gly Glu Asp Lys Val Phe Arg Ala 
980 985 990 

Gly Thr He Gly Thr Val Ala Glu Lys Thr Ala Phe Gly Tyr Val Lys 
995 1000 1005 

Gly Tyr Leu Asn Asp Gin Gly He His Lys Arg Gly Ala Glu lie Asp 
1010 1015 1020 

Arg Leu Val Lys Gly Cys Thr Gly Val Lys Ala Thr Thr Gly Gin His 
1025 1030 1035 1040 

Pro Gly Gly He He Val Val Pro Asp Tyr Met Asp He Tyr Asp Phe 
1045 1050 1055 

Thr Pro He Gin Tyr Pro Ala Asp Asp Gin Asn Ser Ala Trp Met Thr 
1060 1065 1070 

Thr His Phe Asp Phe His Ser He His Asp Asn Val Leu Lys Leu Asp . 
1075 1080 1085 

He Leu Gly His Asp Asp Pro Thr Met He Arg Met Leu Gin Asp Leu 
1090 1095 1100 

Ser Gly He Asp Pro Lys Thr He Pro Val Asp Asp Lys Glu Val Met 
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1105 1110 1115 1120 

Gin lie Phe Ser Thr Pro Glu Ser Leu Gly Val Thr Glu Asp Glu He 
1125 1130 1135 

Leu Cys Lys Thr Gly Thr Phe Gly Val Pro Asn Ser Asp Arg He Arg 
1140 1145 1150 

Arg Gin Met Leu Glu Asp Thr Lys Pro Thr Thr Phe Ser Glu Leu Val 
1155 1160 1165 

Gin lie Ser Gly Leu Ser His Gly Thr Asp Val Trp Leu Gly Asn Ala 
1170 1175 1180 

Gin Glu Leu lie Lys Thr Gly lie Cys Asp Leu Ser Ser Val lie Gly 
1185 1190 1195 1200 

Cys Arg Asp Asp He Met Val Tyr Leu Met Tyr Ala Gly Leu Glu Pro 
1205 1210 1215 

Ser Met Ala Phe Lys lie Met Glu Ser Val Arg Lys Gly Lys Gly Leu 
1220 1225 1230 

Thr Glu Glu Met lie Glu Thr Met Lys Glu Asn Glu Val Pro Asp Trp 
1235 1240 1245 

Tyr Leu Asp Ser Cys Leu Lys He Lys Tyr He Phe Pro Lys Ala His 
1250 1255 1260 

Ala Ala Ala Tyr Val Leu Met Ala Val Arg He Ala Tyr Phe Lys Val 
1265 1270 1275 1280 

His His Pro Leu Tyr Tyr Tyr Ala Ser Tyr Phe Thr He Arg Ala Ser 
1285 1290 1295 

Asp Phe Asp Leu He Thr Met He Lys Asp Lys Thr Ser He Arg Asn 
1300 1305 1310 

Thr Val Lys Asp Met Tyr Ser Arg Tyr Met Asp Leu Gly Lys Lys Glu 
1315 1320 1325 

Lys Asp . Val Leu Thr Val Leu Glu He Met Asn Glu Met Ala His Arg 
1330 1335 1340 

Gly Tyr Arg Met Gin Pro He Ser Leu Glu Lys Ser Gin Ala Phe Glu 
1345 1350 1355 1360 

Phe He He Glu Gly Asp Thr Leu He Pro Pro Phe He Ser Val Pro 



19 



I/O O M-Q O '7' 



WO 01/09164 



PCT/US00/20666 



1365 



1370 



1375 



Gly Leu Gly Glu Asn Val Ala Lys Arg lie Val Glu Ala Arg Asp Asp 
1380 1385 1390 



Gly Pro Phe Leu Ser Lys Glu Asp Leu Asn Lys Lys Ala Gly Leu Tyr 
1395 1400 1405 



Gin Lys lie lie Glu Tyr Leu Asp Glu Leu Gly Ser Leu Pro Asn Leu 
1410 1415 1420 



Pro Asp Lys Ala Gin Leu Ser He Phe Asp Met 
1425 1430 1435 



<210> 9 
<211> 1134 
<212> DNA 

<213> Staphylococcus aureus 
<400> 9 

atgatggaat tcactattaa aagagattat tttattacac aattaaatga cacattaaaa 60 
gctatttcac caagaacaac attacctata ttaactggta tcaaaatcga tgcgaaagaa 120 
catgaagtta tattaactgg ttcagactct gaaatttcaa tagaaatcac tattcctaaa 180 
actgtagatg gcgaagatat tgtcaatatt tcagaaacag gctcagtagt acttcctgga 240 
cgattctttg ttgatattat aaaaaaatta cctggtaaag atgttaaatt atctacaaat 300 
gaacaattcc agacattaat tacatcaggt cattctgaat ttaatttgag tggcttagat 360 
ccagatcaat atcctttatt acctcaagtt tctagagatg acgcaattca attgtcggta 420 
aaagtactta aaaacgtgat tgcacaaacg aattttgcag tgtccacctc agaaacacgc 480 
ccagtactaa ctggtgtgaa ctggcttata caagaaaatg aattaatatg cacagcgact 540 
gattcacacc gcttggctgt aagaaagttg cagttagaag atgtttctga aaacaaaaat 600 
gtcatcattc caggtaaggc tttagctgaa ttaaataaaa ttatgtctga caatgaagaa 660 
gacattgata tcttctttgc ttcaaaccaa gttttattta aagttggaaa tgtgaacttt 72 0 
atttctcgat tattagaagg acattatcct gatacaacac gtttattccc tgaaaactat 7 80 
gaaattaaat taagtataga caatggggag ttttatcatg cgattgatcg tgcctcttta 840 
ttagcacgtg aaggtggtaa taacgttatt aaattaagta caggtgatga cgttgttgaa 900 
ttatcttcta catcaccaga aattggtact gtaaaagaag aagttgatgc aaacgatgtt 960 
gaaggtggta gcctgaaaat ttcattcaac tctaaatata tgatggatgc tttaaaagca 102 0 
atcgataatg atgaggttga agttgaattc ttcggtacaa tgaaaccatt tattctaaaa 1080 
ccaaaaggtg acgactcggt aacgcaatta attttaccaa tcagaactta ctaa 1134 

<210> 10 
<211> 377 
<212> PRT 

<213> Staphylococcus aureus 



<400> 10 
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Met Met Glu Phe Thr He Lys Arg Asp Tyr Phe He Thr Gin Leu Asn 
1 5 10 15 

Asp Thr Leu Lys Ala lie Ser Pro Arg Thr Thr Leu Pro He Leu Thr 
20 25 30 

Gly He Lys He Asp Ala Lys Glu His Glu Val He Leu Thr Gly Ser 
35 40 45 

Asp Ser Glu He Ser He Glu He Thr He Pro Lys Thr Val Asp Gly 
50 55 60 

Glu Asp He Val Asn He Ser Glu Thr Gly Ser Val Val Leu Pro Gly 
65 70 75 80 

Arg Phe Phe Val Asp He He Lys Lys Leu Pro Gly Lys Asp Val Lys 
85 90 95 

Leu Ser Thr Asn Glu Gin Phe Gin Thr Leu He Thr Ser Gly His Ser 
100 105 110 

Glu Phe Asn Leu Ser Gly Leu Asp Pro Asp Gin Tyr Pro Leu Leu Pro 
115 120 125 

Gin Val Ser Arg Asp Asp Ala He Gin Leu Ser Val Lys Val Leu Lys 
X30 135 140 

Asn Val He Ala Gin Thr Asn Phe Ala Val Ser Thr Ser Glu Thr Arg 
145 150 155 160 

Pro Val Leu Thr Gly Val Asn Trp Leu He Gin Glu Asn Glu Leu He 
165 170 175 

Cys Thr Ala Thr Asp Ser His Arg Leu Ala Val Arg Lys Leu Gin Leu 
180 185 190 

Glu Asp Val Ser Glu Asn Lys Asn Val He He Pro Gly Lys Ala Leu 
195 200 205 

Ala Glu Leu Asn Lys He Met Ser Asp Asn Glu Glu Asp He Asp He 
210 215 220 

Phe Phe Ala Ser Asn Gin Val Leu Phe Lys Val Gly Asn Val Asn Phe 
225 230 235 240 

He Ser Arg Leu Leu Glu Gly His Tyr Pro Asp Thr Thr Arg Leu Phe 
245 250 255 
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Pro Glu Asn Tyr Glu 
260 

His Ala lie Asp Arg 
275 

Val Xle Lys Leu Ser 
290 

Ser Pro Glu lie Gly 
305 

Glu Gly Gly Ser Leu 
325 

Ala Leu Lys Ala Xle 
340 

Thr Met Lys Pro Phe 

355 

Gin Leu lie Leu Pro 
370 



Xle Lys Leu Ser Xle 
265 

Ala Ser Leu Leu Ala 
280 

Thr Gly Asp Asp Val 
295 

Thr Val Lys Glu Glu 
310 

Lys lie Ser Phe Asn 
330 

Asp Asn Asp Glu Val 
345 

Xle Leu Lys Pro Lys 
360 

lie Arg Thr Tyr 

375 



Asp Asn Gly Glu Phe Tyr 
270 

Arg Glu Gly Gly Asn Asn 
285 

Val Glu Leu Ser Ser Thr 
300 

Val Asp Ala Asn Asp Val 
315 320 

Ser Lys Tyr Met Met Asp 
335 

Glu Val Glu Phe Phe Gly 
350 

Gly Asp Asp Ser Val Thr 
365 



<210> 11 
<211> 930 
<212> DNA 

<213> Staphylococcus aureus 
<400> 11 

atggatgaac agcaacaatt gacgaatgca tatcattcaa ataaattatc gcatgcctat 60 
ttatttgaag gtgatgatgc acaaacgatg aaacaagttg cgattaattt tgcaaagctt 120 
attttatgtc aaacagatag tcaatgtgaa acaaaggtta gtacatataa tcatccagac 180 
tttatgtata tatcaacaac tgagaatgca attaagaaag aacaagttga acaacttgtg 240 
cgtcatatga atcaacttcc tatagaaagc acaaataaag tgtacatcat cgaagacttt 300 
gaagactttg aaaagttaac tgttcaaggg gaaaacagta tcttgaaatt tcttgaagaa 3 60 
ccaccggaca atacgattgc tattttattg tctacaaaac ctgagcaaat tttagacaca 42 0 
atccattcaa ggtgtcagca tgtatatttc aagcctattg ataaagaaaa gtttataaat 480 
agattagttg aacaaaacat gtctaagcca gtagctgaaa tgattagtac ttatactacg 540 
caaatagata atgcaatggc tttaaatgaa gaatttgatt tattagcatt aaggaaatca 600 
gttatacgtt gggaattgtt gcttactaat aagccaatgg cacttatagg tattattgat 660 
ttattgaaac aggctaaaaa taaaaaactg caatctttaa ctattgcagc tgtgaatggt 720 
ttcttcgaag atatcataca tacaaaggta aatgtagagg ataaacaaat atatagtgat 7 80 
ttaaaaaatg atattgatca atatgcgcaa aagttgtcgt ttaatcaatt aattttgatg 840 
tttgatcaac tgacggaagc acataagaaa ttgaatcaaa atgtaaatcc aacgcttgta 900 
tttgaacaaa tcgtaattaa gggtgtgagt 930 
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<210> 12 
<211> 310 
<212> PRT 

<213> Staphylococcus aureus 
<400> 12 

Met Asp Glu Gin Gin Gin Leu Thr Asn Ala Tyr His Ser Asn Lys Leu 
15 10 15 

Ser His Ala Tyr Leu Phe Glu Gly Asp Asp Ala Gin Thr Met Lys Gin 
20 25 30 

Val Ala lie Asn Phe Ala Lys Leu lie Leu Cys Gin Thr Asp Ser Gin 
35 40 45 

Cys Glu Thr Lys Val Ser Thr Tyr Asn His Pro Asp Phe Met Tyr lie 
50 55 60 

Ser Thr Thr Glu Asn Ala lie Lys Lys Glu Gin Val Glu Gin Leu Val 
65 70 75 80 

Arg His Met Asn Gin Leu Pro lie Glu Ser Thr Asn Lys Val Tyr lie 
85 90 95 

lie Glu Asp Phe Glu Asp Phe Glu Lys Leu Thr Val Gin Gly Glu Asn 
100 105 110 

Ser lie Leu Lys Phe Leu Glu Glu Pro Pro Asp Asn Thr lie Ala lie 
115 120 125 

Leu Leu Ser Thr Lys Pro Glu Gin lie Leu Asp Thr lie His Ser Arg 
130 135 140 

Cys Gin His Val Tyr Phe Lys Pro lie Asp Lys Glu Lys Phe He Asn 
145 150 155 160 

Arg Leu Val Glu Gin Asn Met Ser Lys Pro Val Ala Glu Met He Ser 
165 170 175 

Thr Tyr Thr Thr Gin lie Asp Asn Ala Met Ala Leu Asn Glu Glu Phe 
180 185 190 

Asp Leu Leu Ala Leu Arg Lys Ser Val lie Arg Trp Glu Leu Leu Leu 
195 200 205 

Thr Asn Lys Pro Met Ala Leu lie Gly lie lie Asp Leu Leu Lys Gin 
210 215 220 
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Ala Lys Asn Lys Lys Leu Gin Ser Leu Thr lie Ala Ala Val Asn Gly 
225 230 235 240 

Phe Phe Glu Asp lie lie His Thr Lys Val Asn Val Glu Asp Lys Gin 
245 250 255 

Xle Tyr Ser Asp Leu Lys Asn Asp lie Asp Gin Tyr Ala Gin Lys Leu 
260 265 270 

Ser Phe Asn Gin Leu lie Leu Met Phe Asp Gin Leu Thr Glu Ala His 
275 280 285 

Lys Lys Leu Asn Gin Asn Val Asn Pro Thr Leu Val Phe Glu Gin lie 
290 295 300 

Val lie Lys Gly Val Ser 
305 310 



<210> 13 
<211> 744 
<212> DNA 

<213> Staphylococcus aureus 
<400> 13 

atgagcgaca atattgtagc tatttatgga gatgtgcctg aattggttga aaaacaaagt 60 

gcagaaatca tatcacaatt tttgaaaagt gatagagatg actttaactt tgtgaaatat 12 0 

aatttatacg aaacagagat tgcaccaatt gttgaagaaa cattaacatt gcctttcttt 180 

tcagataaaa aagcaatttt ggttaaaaat gcatatatat ttacaggtga aaaagcgcca 240 

aaagatatgg ctcataatgt agaccaatta atagaattta ttgaaaaata tgatggcgaa 3 00 

aatttgattg tctttgagat atatcaaaat aaacttgatg aaagaaaaaa gttaactaaa 3 60 

actctaaaaa agcatgcaag gcttaaaaaa atagagcaga tgtcggagga gatcaagtgg 420 

attcaaaaaa aagaacaagc gattgatttt gtaaaagatc ttataacaat gaaagaagaa 4 80 

ccaattaaac ttcttgcact tacatcaaat tatagacttt tttatcaatg taaaattctt 540 

tcacaaaaag gttatagtgg tcaacaaatt gcaaaaacaa taggtgttca tccatataga 600 

gtgaaacttg cacttggtca agtgagacat tatcaacttg atgaacttct taatattatt 660 

gatgcatgtg cagaaacaga ttataaactt aaatcatcat atatggataa acaacttatt 720 

cttgaacttt ttattctttc actt 744 



<210> 14 
<211> 248 
<212> PRT 

<213> Staphylococcus aureus 
<400> 14 

Met Ser Asp Asn lie Val Ala He Tyr Gly Asp Val Pro Glu Leu Val 
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15 10 15 

Glu Lys Gin Ser Ala Glu He He Ser Gin Phe Leu Lys Ser Asp Arg 
20 25 30 

Asp Asp Phe Asn Phe Val Lys Tyr Asn Leu Tyr Glu Thr Glu He Ala 
35 40 45 

Pro lie Val Glu Glu Thr Leu Thr Leu Pro Phe Phe Ser Asp Lys Lys 
50 55 60 

Ala lie Leu Val Lys Asn Ala Tyr He Phe Thr Gly Glu Lys Ala Pro 
65 70 75 80 

Lys Asp Met Ala His Asn Val Asp Gin Leu He Glu Phe He Glu Lys 
85 90 95 

Tyr Asp Gly Glu Asn Leu He Val Phe Glu He Tyr Gin Asn Lys Leu 
100 105 110 

Asp Glu Arg Lys Lys Leu Thr Lys Thr Leu Lys Lys His Ala Arg Leu 
115 120 125 

Lys Lys He Glu Gin Met Ser Glu Glu He Lys Trp He Gin Lys Lys 
130 135 140 

Glu Gin Ala He Asp Phe Val Lys Asp Leu He Thr Met Lys Glu Glu 
145 150 155 160 

Pro He Lys Leu Leu Ala Leu Thr Ser Asn Tyr Arg Leu Phe Tyr Gin 
165 170 175 

Cys Lys He Leu Ser Gin Lys Gly Tyr Ser Gly Gin Gin He Ala Lys 
180 185 190 

Thr He Gly Val His Pro Tyr Arg Val Lys Leu Ala Leu Gly Gin Val 
195 200 205 

Arg His Tyr Gin Leu Asp Glu Leu Leu Asn He He Asp Ala Cys Ala 
210 215 220 

Glu Thr Asp Tyr Lys Leu Lys Ser Ser Tyr Met Asp Lys Gin Leu He 
225 230 235 240 

Leu Glu Leu Phe He Leu Ser Leu 
245 
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<210> 15 
<211> 1719 
<212> DNA 

<213> Staphylococcus aureus 
<400> 15 

atgataggtt tgtgtccttt tcatgatgaa aagacacctt catttacagt ttctgaagat 60 
aaacaaatct gtcattgttt tggttgtaaa aaaggtggca atgtttttca atttactcaa 12 0 
gaaattaaag acatatcatt tgttgaagcg gttaaagaat taggtgatag agttaatgtt 180 
gctgtagata ttgaggcaac acaatctaac tcaaatgttc aaattgcttc tgatgattta 240 
caaatgattg aaatgcatga gttaatacaa gaattttatt attacgcttt aacaaagaca 300 
gtcgaaggcg aacaagcatt aacatactta caagaacgtg gttttacaga tgcgcttatt 3 60 
aaagagcgag gcattggctt tgcacccgat agctcacatt tttgtcatga ttttcttcaa 420 
aaaaagggtt acgatattga attagcatat gaagccggat tattatcacg taacgaagaa 480 
aatttcagtt attacgatag atttcgaaat cgtattatgt ttcctttgaa aaatgcgcaa 540 
ggaagaattg ttggatattc aggtcgaaca tataccggtc aagaaccaaa atacctaaat 600 
agtcctgaaa cgcctatctt tcaaaaaaga aagttgttat ataacttaga taaagcacgt 660 
aaatcaatta gaaaattaga tgaaattgta ttactagaag gttttatgga tgttataaaa 72 0 
tctgatactg ctggcttgaa aaacgttgtt gcaacaatgg gtacacagtt gtcagatgaa 7 80 
catattacct ttatacgaaa gttaacatca aatataacat taatgtttga tggggatttt 840 
gcgggtagtg aagcaacact taaaacaggt caacatttgt tacagcaagg gctaaatgta 900 
tttgttatac aattgccatc tggcatggat ccggatgaat acattggtaa gtatggcaac 960 
gacgcattta ctacttttgt aaaaaatgac aaaaagtcat ttgcacatta taaagtaagt 1020 
atattaaaag atgaaattgc acataatgac ctttcatatg aacgttattt gaaagaactg 1080 
agtcatgaca tttcacttat gaagtcatca attctgcaac aaaaggctat aaatgatgtt 1140 
gcgccatttt tcaatgttag tcctgagcag ttagctaacg aaatacaatt caatcaagca 1200 
ccagccaatt attatccaga agatgagtat ggcggttatg atgagtatgg cggttatatt 1260 
gaacctgagc caattggtat ggcacaattt gacaatttga gccgtcgaga aaaagcggag 1320 
cgagcatttt taaaacattt aatgagagat aaagatacat ttttaaatta ttatgaaagt 1380 
gttgataagg ataacttcac aaatcagcat tttaaatatg tattcgaagt cttacatgat 1440 
ttttatgcgg aaaatgatca atataatatc agtgatgctg tgcagtatgt taattcaaat 1500 
gagttgagag aaacactaat tagcttagaa caatataatt tgaatggcga accatatgaa 1560 
aatgaaattg atgattatgt caatgttatt aatgaaaaag gacaagaaac aattgagtca 162 0 
ttgaatcata aattaaggga agctacaagg attggcgatg tagaattaca aaaatactat 1680 
ttacagcaaa ttgttgctaa gaataaagaa cgcatgtag 1719 



<210> 16 
<211> 572 
<212> PRT 

<213> Staphylococcus aureus 
<400> 16 

Met lie Gly Leu Cys Pro Phe His Asp Glu Lys Thr Pro Ser Phe Thr 
15 10 15 

Val Ser Glu Asp Lys Gin lie Cys His Cys Phe Gly Cys Lys Lys Gly 
20 25 30 
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Gly Asn Val Phe Gin Phe Thr Gin Glu lie Lys Asp He Ser Phe Val 
35 40 45 

Glu Ala Val Lys Glu Leu Gly Asp Arg Val Asn Val Ala Val Asp He 
50 55 60 

Glu Ala Thr Gin Ser Asn Ser Asn Val Gin lie Ala Ser Asp Asp Leu 
65 70 75 80 

Gin Met lie Glu Met His Glu Leu lie Gin Glu Phe Tyr Tyr Tyr Ala 
85 90 95 

Leu Thr Lys Thr Val Glu Gly Glu Gin Ala Leu Thr Tyr Leu Gin Glu 
100 105 110 

Arg Gly Phe Thr Asp Ala Leu lie Lys Glu Arg Gly lie Gly Phe Ala 
115 120 125 

Pro Asp Ser Ser His Phe Cys His Asp Phe Leu Gin Lys Lys Gly Tyr 
130 135 140 

Asp lie Glu Leu Ala Tyr Glu Ala Gly Leu Leu Ser Arg Asn Glu Glu 
145 150 155 160 

Asn Phe Ser Tyr Tyr Asp Arg Phe Arg Asn Arg lie Met Phe Pro Leu 
165 170 175 

Lys Asn Ala Gin Gly Arg He Val Gly Tyr Ser Gly Arg Thr Tyr Thr 
180 185 190 

Gly Gin Glu Pro Lys Tyr Leu Asn Ser Pro Glu Thr Pro He Phe Gin 
195 200 205 

Lys Arg Lys Leu Leu Tyr Asn Leu Asp Lys Ala Arg Lys Ser He Arg 
210 215 220 

Lys Leu Asp Glu He Val Leu Leu Glu Gly Phe Met Asp Val He Lys 
225 230 235 240 

Ser Asp Thr Ala Gly Leu Lys Asn Val Val Ala Thr Met Gly Thr Gin 
245 250 255 

Leu Ser Asp Glu His He Thr Phe He Arg Lys Leu Thr Ser Asn He 
260 265 270 

Thr Leu Met Phe Asp Gly Asp Phe Ala Gly Ser Glu Ala Thr Leu Lys 
275 280 285 
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Thr Gly Gin His Leu 
290 

Leu Pro Ser Gly Met 
305 

Asp Ala Phe Thr Thr 

325 

Tyr Lys Val Ser lie 
340 

Tyr Glu Arg Tyr Leu 
355 

Ser Ser lie Leu Gin 
370 

Asn Val Ser Pro Glu 
385 

Pro Ala Asn Tyr Tyr 
405 

Gly Gly Tyr lie Glu 
420 

Leu Ser Arg Arg Glu 
435 

Arg Asp Lys Asp Thr 
450 

Asn Phe Thr Asn Gin 
465 

Phe Tyr Ala Glu Asn 
485 

Val Asn Ser Asn Glu 
500 

Asn Leu Asn Gly Glu 
515 

Val Xle Asn Glu Lys 
530 



Leu Gin Gin Gly Leu Asn 
295 

Asp Pro Asp Glu Tyr lie 
310 315 

Phe Val Lys Asn Asp Lys 
330 

Leu Lys Asp Glu lie Ala 
345 

Lys Glu Leu Ser His Asp 
360 

Gin Lys Ala Xle Asn Asp 
375 

Gin Leu Ala Asn Glu Xle 
390 395 

Pro Glu Asp Glu Tyr Gly 
410 

Pro Glu Pro lie Gly Met 
425 

Lys Ala Glu Arg Ala Phe 
440 

Phe Leu Asn Tyr Tyr Glu 
455 

His Phe Lys Tyr Val Phe 
470 475 

Asp Gin Tyr Asn lie Ser 
490 

Leu Arg Glu Thr Leu lie 
505 

Pro Tyr Glu Asn Glu lie 
520 

Gly Gin Glu Thr lie Glu 
535 



Val Phe Val lie Gin 
300 

Gly Lys Tyr Gly Asn 
320 

Lys Ser Phe Ala His 
335 

His Asn Asp Leu Ser 
350 

Xle Ser Leu Met Lys 
365 

Val Ala Pro Phe Phe 
380 

Gin Phe Asn Gin Ala 
400 

Gly Tyr Asp Glu Tyr 
415 

Ala Gin Phe Asp Asn 
430 

Leu Lys His Leu Met 
445 

Ser Val Asp Lys Asp 
460 

Glu Val Leu His Asp 
480 

Asp Ala Val Gin Tyr 
495 

Ser Leu Glu Gin Tyr 
510 

Asp Asp Tyr Val Asn 
525 

Ser Leu Asn His Lys 
540 
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Leu Arg Glu Ala Thr Arg lie Gly Asp Val Glu Leu Gin Lys Tyr Tyr 

545 550 555 560 

Leu Gin Gin Tie Val Ala Lys Asn Lys Glu Arg Met: 
565 570 



<210> 17 
<211> 4395 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 17 

atgtcagatt tattcgctaa attgatggac 

tcaagtgcct tttcatctgc tgatattatc 

tgggaatttc attttgcctt tgcagcggtt 

gatcgtttga taagaacttt tgaggcggct 

gctcaggtgg attattcaga tgatctgctt 

gcaccgtgta atagtgctag ttttaaatct 

gatgacaaac tcattattgc agcgccaggt 

catctgccta atctggtcaa gcaattagaa 

atggtgtcag atcaggaaat gactgagcat 

gctcttgtga aaaaggctgt gcaggataat 

atgccaccag ttgaggaagc cacacctgct 

aagcgtcagg cagggtttga aaaagcaacc 

gaaaaccgga ttgtctttga gggtatggtt 

ggtcgccata tcatcaactt taaaatgaca 

tgggctaaag acgatgagga gctccgtaaa 

cgggtacaag ggaatattga gaccaatcct 

caggtcaaag aaattgtccg tcatgagcgc 

gtcgaacttc atgcccacac caatatgtct 

ttgattgata cggcagccaa gtggggacac 

gtgcaaagtt ttcctcatgg ctaccatagg 

ggcctagaag ccaatattgt tgaggacaag 

gatttgcacg aagccaccta tgtggtcttt 

aataatgacc tgattcagat tgcggcttcc 

tttgatgaat tcattgatcc tgggcatcct 

attaccgata agcatttgca gggcgccaag 

gacttttgca aagatagtat cttggttgcc 

aacgccaatt atgaacgcca cgacttgccc 

gaatttgcta gaaacttgta tcctgagtac 

cgtttccaag tgagtctaga ccaccatcat 

cgtcttttgt ttatttttct aaaagatgcc 

caactcaata cagatttggt ggctgaggat 

actatctatg tgcaaaatca ggttggtctt 

aatatcaaat attttgaagg ggtgccgcgt 

gagggtttgt tacttggtac agcttgttct 



cagatagaaa tgccacttga catgagacgt 60 
gaggtaaagg tacattcggt gtcacgcttg 120 
ttaccgattg caacttatcg tgaattgcat 180 
gacattaagg taacctttga catccaagct 240 
caagcttatt accaagaagc ttttgagcat 3 00 
tctttctcaa agctcaaagt gacttatgag 360 
tttgtgaata acgatcattt tagaaacaat 420 
gcctttggct ttggcatctt gaccatagat 4 80 
ttgaccaaga attttgtttc cagtcgtcag 540 
ttggaagccc aaaaatctct tgaagccatg 600 
cctaagtttg actacaagga acgagcagct 660 
atcacaccaa tgattgagat tgagaccgaa 720 
tttgacgtgg agcgtaaaac gactaggaca 7 80 
gactatacct cctcgtttgc tctccaaaaa 840 
tttgatatga ttgctaaggg agcttggtta 900 
tttacgaaga gtctcaccat gaatgtccag 960 
aaagacctga tgccagaagg gcaaaagcgg 1020 
accatggatg ccttaccgac agtagaaagc 1080 
aaggcgattg ctatcaccga ccatgctaat 114 0 
gctcgcaaag ctgggattaa ggctattttt 1200 
gtgcctattt cttatgaacc tgttgatatg 12 60 
gacgtggaaa ccacaggtct atctgctatg 1320 
aaaatgttta aaggaaatat tgtagagcag 1380 
ctttcagcct ttaccaccga attgacagga 144 0 
ccattggtta ctgtcctaaa agcttttcag 1500 
cacaacgcca gttttgacgt gggctttatg 1560 
aaaatcacac agcctgtgat tgatacctta 1620 
aagcgtcacg gtttgggacc gctcaccaag 1680 
atggccaatt acgacgcgga agccacagga 1740 
agagaaaagc atggcatcaa aaatcttttg 1800 
tcttacaaaa aagcgcggat taagcatgcg 1860 
aaaaatatgt ttaagttggt cagcctttcc 192 0 
attccaagaa ccgtcttaga tgctcacaga 1980 
gacggcgagg tttttgatgc cgttctgact 2040 
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aaaggaattg atgcagcggt tgatttggct aggtattatg attttatcga aatcatgcca 2100 

ccagccattt accagccatt ggttgtccgt gaattaatca aagatcaagc aggtattgag 2160 

caggtgattc gtgacctcat tgaagtaggg aaacgagcta agaaacctgt gcttgccact 2220 

gggaatgtgc attatctaga gcctgaagaa gagatttacc gtgaaattat tgtgcgtagt 22 80 

cttggtcagg gtgccatgat taatagaaca atcggccgtg gggaaggggc acagcctgct 2340 

cctctaccta aagcgcactt tagaacaacc aatgaaatgc tggatgagtt tgcctttctt 2400 

ggaaaagacc tcgcttatca agtagttgtg caaaatactc aggattttgc ggaccgtatt 2460 

gaggaagtgg aagtggttaa gggcgatctt tacaccccgt atattgataa ggccgaagag 2520 

acggttgccg aattaaccta tcaaaaagcc tttgaaattt atggtaatcc tctcccagat 2 580 

attattgatt tacgcattga aaaagagtta acctctatct tggggaacgg ttttgctgtg 2 640 

atttatctcg cttcccaaat gcttgttaac cggtcaaatg agcgaggcta cctagttggt 2700 

tctaggggat ctgtagggtc tagctttgtc gccaccatga ttgggattac tgaggttaat 27 60 

cctatgccgc ctcactacgt ttgcccgtcc tgccaacatt ctgaatttat cacagatggg 2 820 

tcagttggat ctggctatga tttgcctaat aaaccctgtc cgaaatgtgg caccccttat 2880 

caaaaagatg ggcaagacat tccctttgag acctttcttg ggtttgatgg ggataaggtg 2940 

cccgatattg atttgaactt ctctggtgat gaccagccca gtgcccattt ggatgtccga 3000 

gatatttttg gtgacgaata cgcctttcgt gctggaacag ttggtaccgt agcagaaaaa 3 060 

acagcttatg gatttgtcaa aggctatgaa cgcgactatg gcaagttcta tcgtgatgct 312 0 

gaggtggatc gtctagcagc aggtgctgct ggtgtgaaac gaacgactgg gcagcaccct 3180 

ggggggattg ttgttattcc taattacatg gatgtttatg attttacccc cgtgcaatat 3240 

ccagccgatg atgtaacggc ttcttggcag acaactcact ttaacttcca tgatattgat 3300 

gaaaacgtct tgaaacttga tatcctaggg catgatgatc cgaccatgat tcgtaaactt 3360 

caggatttat cgggcattga tec tat tact attcctgetg atgatccggg agttatggct 3420 

ctcttttctg ggacagaggt tttgggcgtt accccggaac aaattgggac accgactggt 34 80 

atgetaggea ttccagaatt tggaaccaac tttgttcgcg gcatggttaa tgagaegcat 3540 

ccgaccactt ttgeggaget tttgcagttg tctggactat ctcatggaac cgatgtttgg 3600 

cttggtaatg cacaagattt gattaaagaa ggcattgcaa ccctaaaaac cgttatcggt 3 660 

tgtcgtgacg acatcatggt ttacctcatg cacgcaggct tagaaccaaa aatggccttt 372 0 

accattatgg agcgtgtgcg taagggatta tggctaaaaa tttctgagga agaaegtaat 3780 

ggctatattg atgccatgcg agaaaacaat gtgcccgact ggtacattga atcgtgtgga 3 840 

aaaatcaagt acatgttccc taaageccat geggcagett atgttttgat ggcccttcgg 3 900 

gtggcttatt teaaggtgea ccaccccatt atgtattatt gtgcttattt etctattegt 3 960 

gegaaggett ttgaattaaa aaccatgagt ggtggtttag atgctgttaa agcaagaatg 402 0 

gaagatatta ctataaaacg taaaaataat gaagccacca atgtggaaaa tgacctcttt 4080 

acaaccttgg agattgtcaa cgaaatgtta gaacgegget ttaagtttgg caaattagac 414 0 

ctttacaaaa gtgatgetat agaattccaa atcaaaggag atacccttat ccctccattt 4200 

atagegctag aaggtctggg tgaaaacgtg gecaagcaaa tegttaaage tegtcaagaa 4260 

ggegaattec tctctaaaat ggaattgcgt aaacgaggcg gggcatcgtc aacgetegtt 4320 

gagaaaatgg atgagatggg tattttagga aatatgccag aagataatca attaagtctt 43 80 
tttgatgact ttttc 43 95 



<210> 18 
<211> 1465 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 18 
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Met Ser Asp Leu Phe Ala Lys Leu Met Asp Gin lie Glu Met Pro Leu 
15 10 15 

Asp Met Arg Arg Ser Ser Ala Phe Ser Ser Ala Asp tie lie Glu Val 
20 25 30 

Lys Val His Ser Val Ser Arg Leu Trp Glu Phe His Phe Ala Phe Ala 
35 40 45 

Ala Val Leu Pro lie Ala Thr Tyr Arg Glu Leu His Asp Arg Leu He 
50 55 60 

Arg Thr Phe Glu Ala Ala Asp He Lys Val Thr Phe Asp lie Gin Ala 
65 70 75 80 

Ala Gin Val Asp Tyr Ser Asp Asp Leu Leu Gin Ala Tyr Tyr Gin Glu 
85 90 95 

Ala Phe Glu His Ala Pro Cys Asn Ser Ala Ser Phe Lys Ser Ser Phe 
100 105 110 

Ser Lys Leu Lys Val Thr Tyr Glu Asp Asp Lys Leu lie lie Ala Ala 
115 120 125 

Pro Gly Phe Val Asn Asn Asp His Phe Arg Asn Asn His Leu Pro Asn 
130 135 140 

Leu Val Lys Gin Leu Glu Ala Phe Gly Phe Gly He Leu Thr lie Asp 
145 150 155 160 

Met Val Ser Asp Gin Glu Met Thr Glu His Leu Thr Lys Asn Phe Val 
165 170 175 

Ser Ser Arg Gin Ala Leu Val Lys Lys Ala Val Gin Asp Asn Leu Glu 
180 185 190 

Ala Gin Lys Ser Leu Glu Ala Met Met Pro Pro Val Glu Glu Ala Thr 
195 200 205 

Pro Ala Pro Lys Phe Asp Tyr Lys Glu Arg Ala Ala Lys Arg Gin Ala 
210 215 220 

Gly Phe Glu Lys Ala Thr lie Thr Pro Met lie Glu lie Glu Thr Glu 
225 230 235 240 

Glu Asn Arg lie Val Phe Glu Gly Met Val Phe Asp Val Glu Arg Lys 
245 250 255 
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Thr Thr Arg Thr Gly Arg His lie lie Asn Phe Lys Met Thr Asp Tyr 
260 265 270 

Thr Ser Ser Phe Ala Leu Gin Lys Trp Ala Lys Asp Asp Glu Glu Leu 
275 280 285 

Arg Lys Phe Asp Met lie Ala Lys Gly Ala Trp Leu Arg Val Gin Gly 
290 295 300 

Asn lie Glu Thr Asn Pro Phe Thr Lys Ser Leu Thr Met Asn Val Gin 
305 310 315 320 

Gin Val Lys Glu lie Val Arg His Glu Arg Lys Asp Leu Met Pro Glu 
325 330 335 

Gly Gin Lys Arg Val Glu Leu His Ala His Thr Asn Met Ser Thr Met 
340 345 350 

Asp Ala Leu Pro Thr Val Glu Ser Leu lie Asp Thr Ala Ala Lys Trp 
355 360 365 

Gly His Lys Ala lie Ala lie Thr Asp His Ala Asn Val Gin Ser Phe 
370 375 380 

Pro His Gly Tyr His Arg Ala Arg Lys Ala Gly lie Lys Ala lie Phe 
385 390 395 400 

Gly Leu Glu Ala Asn He Val Glu Asp Lys Val Pro He Ser Tyr Glu 
405 410 415 

Pro Val Asp Met Asp Leu His Glu Ala Thr Tyr Val Val Phe Asp Val 
420 425 430 

Glu Thr Thr Gly Leu Ser Ala Met Asn Asn Asp Leu lie Gin lie Ala 
435 440 445 

Ala Ser Lys Met Phe Lys Gly Asn lie Val Glu Gin Phe Asp Glu Phe 
450 455 460 

He Asp Pro Gly His Pro Leu Ser Ala Phe Thr Thr Glu Leu Thr Gly 
465 470 475 480 

lie Thr Asp Lys His Leu Gin Gly Ala Lys Pro Leu Val Thr Val Leu 
485 490 495 

Lys Ala Phe Gin Asp Phe Cys Lys Asp Ser He Leu Val Ala His Asn 
500 505 510 
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Ala Ser Phe Asp Val Gly Phe Met Asn Ala Asn Tyr Glu Arg His Asp 
515 520 525 

Leu Pro Lys lie Thr Gin Pro Val He Asp Thr Leu Glu Phe Ala Arg 
530 535 540 

Asn Leu Tyr Pro Glu Tyr Lys Arg His Gly Leu Gly Pro Leu Thr Lys 
545 550 555 560 

Arg Phe Gin Val Ser Leu Asp His His His Met Ala Asn Tyr Asp Ala 
565 570 575 

Glu Ala Thr Gly Arg Leu Leu Phe He Phe Leu Lys Asp Ala Arg Glu 
580 585 590 

Lys His Gly lie Lys Asn Leu Leu Gin Leu Asn Thr Asp Leu Val Ala 
595 600 605 

Glu Asp Ser Tyr Lys Lys Ala Arg He Lys His Ala Thr He Tyr Val 
610 615 620 

Gin Asn Gin Val Gly Leu Lys Asn Met Phe Lys Leu Val Ser Leu Ser 
625 630 635 640 

Asn He Lys Tyr Phe Glu Gly Val Pro Arg He Pro Arg Thr Val Leu 
645 650 655 

Asp Ala His Arg Glu Gly Leu Leu Leu Gly Thr Ala Cys Ser Asp Gly 
660 665 670 

Glu Val Phe Asp Ala Val Leu Thr Lys Gly He Asp Ala Ala Val Asp 
675 680 685 

Leu Ala Arg Tyr Tyr Asp Phe He Glu He Met Pro Pro Ala He Tyr 
690 695 700 

Gin Pro Leu Val Val Arg Glu Leu He Lys Asp Gin Ala Gly He Glu 
705 710 715 720 

Gin Val He Arg Asp Leu He Glu Val Gly Lys Arg Ala Lys Lys Pro 
725 730 735 

Val Leu Ala Thr Gly Asn Val His Tyr Leu Glu Pro Glu Glu Glu He 
740 745 750 

Tyr Arg Glu He He Val Arg Ser Leu Gly Gin Gly Ala Met He Asn 
755 760 765 
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Arg Thr lie Gly Arg Gly Glu Gly Ala Gin Pro Ala Pro Leu Pro Lys 
770 775 780 

Ala His Phe Arg Thr Thr Asn Glu Met Leu Asp Glu Phe Ala Phe Leu 
785 790 795 800 

Gly Lys Asp Leu Ala Tyr Gin Val Val Val Gin Asn Thr Gin Asp Phe 
805 810 815 

Ala Asp Arg lie Glu Glu Val Glu Val Val Lys Gly Asp Leu Tyr Thr 
820 825 830 

Pro Tyr He Asp Lys Ala Glu Glu Thr Val Ala Glu Leu Thr Tyr Gin 
835 840 845 

Lys Ala Phe Glu He Tyr Gly Asn Pro Leu Pro Asp He lie Asp Leu 
850 855 860 

Arg He Glu Lys Glu Leu Thr Ser He Leu Gly Asn Gly Phe Ala Val 
865 870 875 880 

He Tyr Leu Ala Ser Gin Met: Leu Val Asn Arg Ser Asn Glu Arg Gly 
885 890 895 

Tyr Leu Val Gly Ser Arg Gly Ser Val Gly Ser Ser Phe Val Ala Thr 
900 905 910 

Met He Gly He Thr Glu Val Asn Pro Met Pro Pro His Tyr Val Cys 
915 920 925 

Pro Ser Cys Gin His Ser Glu Phe He Thr Asp Gly Ser Val Gly Ser 
930 935 940 

Gly Tyr Asp Leu Pro Asn Lys Pro Cys Pro Lys Cys Gly Thr Pro Tyr 
945 950 955 960 

Gin Lys Asp Gly Gin Asp He Pro Phe Glu Thr Phe Leu Gly Phe Asp 
965 970 975 

Gly Asp Lys Val Pro Asp He Asp Leu Asn Phe Ser Gly Asp Asp Gin 
980 985 990 

Pro Ser Ala His Leu Asp Val Arg Asp He Phe Gly Asp Glu Tyr Ala 
995 1000 1005 

Phe Arg Ala Gly Thr Val Gly Thr Val Ala Glu Lys Thr Ala Tyr Gly 
1010 1015 1020 
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Phe Val Lys Gly Tyr Glu Arg Asp Tyr Gly Lys Phe Tyr Arg Asp Ala 
1025 1030 1035 1040 

Glu Val Asp Arg Leu Ala Ala Gly Ala Ala Gly Val Lys Arg Thr Thr 
1045 1050 1055 

Gly Gin His Pro Gly Gly He Val Val He Pro Asn Tyr Met Asp Val 
1060 1065 1070 

Tyr Asp Phe Thr Pro Val Gin Tyr Pro Ala Asp Asp Val Thr Ala Ser 
1075 1080 1085 

Trp Gin Thr Thr His Phe Asn Phe His Asp lie Asp Glu Asn Val Leu 
1090 1095 1100 

Lys Leu Asp lie Leu Gly His Asp Asp Pro Thr Met lie Arg Lys Leu 
1105 1110 1115 1120 

Gin Asp Leu Ser Gly Xle Asp Pro lie Thr lie Pro Ala Asp Asp Pro 
1125 1130 1135 

Gly Val Met Ala Leu Phe Ser Gly Thr Glu Val Leu Gly Val Thr Pro 
1140 1145 1150 

Glu Gin lie Gly Thr Pro Thr Gly Met Leu Gly lie Pro Glu Phe Gly 
1155 1160 1165 

Thr Asn Phe Val Arg Gly Met Val Asn Glu Thr His Pro Thr Thr Phe 
1170 1175 1180 

Ala Glu Leu Leu Gin Leu Ser Gly Leu Ser His Gly Thr Asp Val Trp 
1185 1190 1195 1200 

Leu Gly Asn Ala Gin Asp Leu He Lys Glu Gly lie Ala Thr Leu Lys 
1205 1210 1215 

Thr Val lie Gly Cys Arg Asp Asp He Met Val Tyr Leu Met His Ala 
1220 1225 1230 

Gly Leu Glu Pro Lys Met Ala Phe Thr He Met Glu Arg Val Arg Lys 
1235 1240 1245 

Gly Leu Trp Leu Lys He Ser Glu Glu Glu Arg Asn Gly Tyr He Asp 
1250 1255 1260 

Ala Met Arg Glu Asn Asn Val Pro Asp Trp Tyr He Glu Ser Cys Gly 
1265 1270 1275 1280 
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Lys lie Lys Tyr Met Phe Pro Lys Ala His Ala Ala Ala Tyr Val Leu 
1285 1290 1295 

Met Ala Leu Arg Val Ala Tyr Phe Lys Val His His Pro He Met Tyr 
1300 1305 1310 

Tyr Cys Ala Tyr Phe Ser He Arg Ala Lys Ala Phe Glu Leu Lys Thr 
1315 1320 1325 

Met Ser Gly Gly Leu Asp Ala Val Lys Ala Arg Met Glu Asp lie Thr 
1330 1335 1340 

He Lys Arg Lys Asn Asn Glu Ala Thr Asn Val Glu Asn Asp Leu Phe 
1345 1350 1355 1360 

Thr Thr Leu Glu lie Val Asn Glu Met Leu Glu Arg Gly Phe Lys Phe 
1365 1370 1375 

Gly Lys Leu Asp Leu Tyr Lys Ser Asp Ala lie Glu Phe Gin lie Lys 
1380 1385 1390 

Gly Asp Thr Leu lie Pro Pro Phe lie Ala Leu Glu Gly Leu Gly Glu 
1395 1400 1405 

Asn Val Ala Lys Gin He Val Lys Ala Arg Gin Glu Gly Glu Phe Leu 
1410 1415 1420 

Ser Lys Met Glu Leu Arg Lys Arg Gly Gly Ala Ser Ser Thr Leu Val 
1425 1430 1435 1440 

Glu Lys Met Asp Glu Met Gly He Leu Gly Asn Met Pro Glu Asp Asn 
1445 1450 1455 

Gin Leu Ser Leu Phe Asp Asp Phe Phe 
1460 1465 



<210> 19 
<211> 3102 
<212> DNA 

<213> Streptococcus pyogenes 

aattgactta 60 
catggataag 120 
actgcagcca 180 
taacttaatc 240 
aatgtctggc 3 00 



<400> 19 

atgtttgctc 

aatcattatt 

gataatcttt 

gttttaggtt 

gcccagaata 



aacttgatac 
ttgaacgagc 
atggtgctta 
tggaaataga 
cacaaggcta 



taaaactgta 
aaagcaattt 
ccattttatt 
gattctctat 
tcatcagctt 



tactcattta 
ggttaccaca 
aaaggttgtc 
caagagcggc 
ttaaaaattt 



tggatagttt 
ccataggaat 
aaaaaaatgg 
aggtgctcct 
ccacggcaaa 
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aagcttcata tggattacfct 
aagggttgga gcgatacatt 
actgatttat ctcatatgga 
tttgcgcaag atgatatgga 
ctggcagaga cccctgtggt 
gccttctatc aaacacactg 
atctattatg atttcgatac 
aagcaagaat tgcaagactt 
ccttatcaat cgcgcttact 
tattttttga ttgtgtggga 
atgggacgtg gctcggcggc 
gatccagttc aacatgattt 
cctgatattg atatcgatct 
aatcgttatg gtagcgacca 
attcgtgatg ttttcaaacg 
aaaattggtt tfcaaagafcag 
gttattaata gtagaactga 
aatccaagac aaacgtccat 
aatcatattc ctctaaaatc 
gtcgaagcta atggcctgtt 
caaaaaatgc aagagaaggt 
gatttagaag acccgcaaac 
caatttgaac aaaatggtgc 
gaaattgttg ccactaccag 
attaaacgaa gagaaggaca 
ttagagccaa cttacggtat 
tatgctggtt ttacgttagg 
ctacaagaaa tgcaaaaaat 
gctgaagaaa cagctagagg 
aaccgcagcc atgcctttgc 
cattacccgg ctgtttttta 
gatgctctag aatcagattt 
gataaaattg aagctagcaa 
gattttgctt attggattat 
agaactccag aaaaatatca 
tttgattgct ttgagcctaa 
tttgttaatg agcttggttc 
gattactcag taactgaaaa 
aagcatcctt taattgatat 
ttagtcaaag aaagcgaagc 
accaaaacaa gtgggcagca 
gatgtcacac tttttccaca 
ttc tat tact taaaaggtag 
caagtgcaaa tggctattag 
tcccaaattt ctgagatttt 
caaaaaaata aggaaacaat 
aaggaaaaac ttcgtccttt 



ctgccaacat ttggaaggga 
agtggtccct tttgactact 
ttctaagagg cagcttatac 
aaccctgcac atgttgcatg 
agaaagtgat caagagttag 
ccctcaagct ctacagaatt 
aaatttaaaa ttgcctcatt 
gactgaggct ggtttgaagg 
acatgaattg gtcattattt 
tttacttcgc tttggacgca 
aggtagtcta gtggcttatg 
gctatttgag cgctttttaa 
tccagatatt taccgttcag 
ttcggcgcaa attgtgacct 
gttcggggtt ccagaatacg 
cttggctact gtctatgaaa 
atttcaaaag gcttttgcca 
tcacgcagct ggtattgtga 
gggcgatgac atgatgatca 
aaaaatggat tttttggggt 
tgctaaagac tacgggtgtc 
gttggcactt tttgctaaag 
tattaatctt ttaaaacgga 
tctaaataga ccaggggcaa 
agaaaaaatt gatttgattg 
tatgctttat caagaacaag 
caaggccgac ttgttaaggc 
ggaagaagac tttattgctt 
actttttaaa cggatggaaa 
ctattcagct ttagcttttc 
cgatatcatg atgaattatt 
tcaagtagcg caagttacca 
gatttacatg gggctgaaaa 
cgagcaaaga ccatttaata 
aaaaaaggtt ttccttgagc 
ccgtaaaaaa attctggaca 
tcttttttca gattcttcct 
atattctttg gaacaggaga 
tgctgagaaa agtacccaaa 
agtcgtactg attcaaatag 
aatggctttt ttaagtgtga 
agagtatgcc atttataaag 
aataaaagaa agagaccatc 
tcaaaaatat tggttattag 
aggtgccttt ccaggaacga 
tgcattaact aagattcagg 
tgttctgaaa acggtttttc 



tagcggttat tattcctagt 360 
atatgggtgt tgatcagtat 42 0 
ccctaaggac agttcgttat 480 
ccattcgaga taacctcagt 540 
cagattgtca acaactaacc 600 
tagaagactt agtgtcagga 660 
ttaatagaga taagtctgcc 720 
aaaaaggatt gtggaaagag 780 
ctgacatggg ctttgatgat 840 
gtaaaggcta ttatatggga 900 
ctctgaacat tacagggatt 960 
acaaagaacg ttatagcatg 1020 
aatttctacg gtatgtccga 1080 
tttcaacctt tggccaggct 1140 
aactgactaa tctcactaaa 1200 
agtcaatctc ttttaggcag 1260 
ttgccaagcg tatcgaagga 1320 
tgagtgatga tgccttgacc 1360 
cccagtatga tgctcatgcg 1440 
taagaaattt gacctttgtt 1500 
agattgatat tacagccatt 1560 
gggataccaa gggaattttc 1620 
ttaagccaca acgttttgaa 1680 
gtgactatac cactaatttc 1740 
atcctgtgat tgctcccatt 1800 
ttatgcagat tgcacaggtt 1860 
gtgccatgtc taaaaaaaat 1920 
ctgctaagca cctagggaga 1980 
aatttgcagg ttatggtttt 2040 
aattggctta tttcaaagcc 2100 
ctagcagtga ctatatcaca 2160 
ttaatagtat tccttacact 222 0 
atattaaggg gttgccaagg 2280 
gcgtagagga ttttctcact 2340 
ctctgataaa aataggtctg 2400 
atttggatgg tttactggta 2460 
ttagttgggt agatacgaaa 252 0 
tcgttggagt tggcatgagc 2580 
cttttactcc tatttcacag 2640 
atagcattag gatcattaga 2700 
atgacactaa gaaaaagctc 27 60 
accaattaaa agaaggagaa 2 82 0 
gactgcagat ggtgtgtcag 2880 
ttgaaaacca tcagtttgat 2940 
ctccagttgt tattcactat 3000 
ttcatgtaac agagaattta 3060 
ga 3102 
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<210> 20 
<211> 1034 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 20 

Met Phe Ala Gin Leu Asp Thr Lys Thr Val Tyr Ser Phe Met Asp Ser 
15 10 15 

Leu lie Asp Leu Asn His Tyr Phe Glu Arg Ala Lys Gin Phe Gly Tyr 
20 25 30 

His Thr lie Gly lie Met Asp Lys Asp Asn Leu Tyr Gly Ala Tyr His 
35 40 45 

Phe lie Lys Gly Cys Gin Lys Asn Gly Leu Gin Pro Val Leu Gly Leu 
50 55 60 

Glu lie Glu lie Leu Tyr Gin Glu Arg Gin Val Leu Leu Asn Leu lie 
65 70 75 80 

Ala Gin Asn Thr Gin Gly Tyr His Gin Leu Leu Lys He Ser Thr Ala 
85 90 95 

Lys Met Ser Gly Lys Leu His Met Asp Tyr Phe Cys Gin His Leu Glu 
100 105 110 

Gly He Ala Val lie lie Pro Ser Lys Gly Trp Ser Asp Thr Leu Val 
115 120 125 

Val Pro Phe Asp Tyr Tyr Met Gly Val Asp Gin Tyr Thr Asp Leu Ser 
130 135 140 

His Met Asp Ser Lys Arg Gin Leu lie Pro Leu Arg Thr Val Arg Tyr 
145 150 155 160 

Phe Ala Gin Asp Asp Met Glu Thr Leu His Met Leu His Ala lie Arg 
165 170 175 

Asp Asn Leu Ser Leu Ala Glu Thr Pro Val Val Glu Ser Asp Gin Glu 
180 185 190 

Leu Ala Asp Cys Gin Gin Leu Thr Ala Phe Tyr Gin Thr His Cys Pro 
195 200 205 

Gin Ala Leu Gin Asn Leu Glu Asp Leu Val Ser Gly lie Tyr Tyr Asp 
210 215 220 
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Phe Asp Thr Asn Leu Lys Leu Pro His Phe Asn Arg Asp Lys Ser Ala 
225 230 235 240 

Lys Gin Glu Leu Gin Asp Leu Thr Glu Ala Gly Leu Lys Glu Lys Gly 
245 250 255 

Leu Trp Lys Glu Pro Tyr Gin Ser Arg Leu Leu His Glu Leu Val lie 
260 265 270 

lie Ser Asp Met Gly Phe Asp Asp Tyr Phe Leu lie Val Trp Asp Leu 
275 280 285 

Leu Arg Phe Gly Arg Ser Lys Gly Tyr Tyr Met Gly Met Gly Arg Gly 
290 295 300 

Ser Ala Ala Gly Ser Leu Val Ala Tyr Ala Leu Asn lie Thr Gly lie 
305 310 315 320 

Asp Pro Val Gin His Asp Leu Leu Phe Glu Arg Phe Leu Asn Lys Glu 
325 330 335 

Arg Tyr Ser Met Pro Asp lie Asp lie Asp Leu Pro Asp lie Tyr Arg 
340 345 350 

Ser Glu Phe Leu Arg Tyr Val Arg Asn Arg Tyr Gly Ser Asp His Ser 
355 360 365 

Ala Gin He Val Thr Phe Ser Thr Phe Gly Pro Lys Gin Ala He Arg 
370 375 380 

Asp Val Phe Lys Arg Phe Gly Val Pro Glu Tyr Glu Leu Thr Asn Leu 
385 390 395 400 

Thr Lys Lys lie Gly Phe Lys Asp Ser Leu Ala Thr Val Tyr Glu Lys 
405 410 415 

Ser lie Ser Phe Arg Gin Val lie Asn Ser Arg Thr Glu Phe Gin Lys 
420 425 430 

Ala Phe Ala lie Ala Lys Arg lie Glu Gly Asn Pro Arg Gin Thr Ser 
435 440 445 

He His Ala Ala Gly He Val Met Ser Asp Asp Ala Leu Thr Asn His 
450 455 460 

He Pro Leu Lys Ser Gly Asp Asp Met Met He Thr Gin Tyr Asp Ala 
465 470 475 480 
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Hi 8 Ala Val Glu Ala Asn Gly Leu Leu Lys Met Asp Phe Leu Gly Leu 
485 490 495 

Arg Asn Leu Thr Phe Val Gin Lys Met Gin Glu Lys Val Ala Lys Asp 
500 505 510 

Tyr Gly Cys Gin He Asp He Thr Ala He Asp Leu Glu Asp Pro Gin 
515 520 525 

Thr Leu Ala Leu Phe Ala Lys Gly Asp Thr Lys Gly lie Phe Gin Phe 
530 535 540 

Glu Gin Asn Gly Ala Xle Asn Leu Leu Lys Arg He Lys Pro Gin Arg 
545 550 555 560 

Phe Glu Glu He Val Ala Thr Thr Ser Leu Asn Arg Pro Gly Ala Ser 
565 570 575 

Asp Tyr Thr Thr Asn Phe He Lys Arg Arg Glu Gly Gin Glu Lys He 
580 585 590 

Asp Leu lie Asp Pro Val He Ala Pro He Leu Glu Pro Thr Tyr Gly 
595 600 605 

He Met Leu Tyr Gin Glu Gin Val Met Gin He Ala Gin Val Tyr Ala 
610 615 620 

Gly Phe Thr Leu Gly Lys Ala Asp Leu Leu Arg Arg Ala Met Ser Lys 
625 630 635 640 

Lys Asn Leu Gin Glu Met Gin Lys Met Glu Glu Asp Phe He Ala Ser 
645 650 655 

Ala Lys His Leu Gly Arg Ala Glu Glu Thr Ala Arg Gly Leu Phe Lys 
660 665 670 

Arg Met Glu Lys Phe Ala Gly Tyr Gly Phe Asn Arg Ser His Ala Phe 
675 680 685 

Ala Tyr Ser Ala Leu Ala Phe Gin Leu Ala Tyr Phe Lys Ala His Tyr 
690 695 700 

Pro Ala Val Phe Tyr Asp He Met Met Asn Tyr Ser Ser Ser Asp Tyr 
705 710 715 720 

He Thr Asp Ala Leu Glu Ser Asp Phe Gin Val Ala Gin Val Thr He 
725 730 735 
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Asn Ser lie Pro Tyr Thr Asp Lys lie Glu Ala Ser Lys lie Tyr Met 
740 745 750 

Gly Leu Lys Asn lie Lys Gly Leu Pro Arg Asp Phe Ala Tyr Trp lie 
755 760 765 

lie Glu Gin Arg Pro Phe Asn Ser Val Glu Asp Phe Leu Thr Arg Thr 
770 775 780 

Pro Glu Lys Tyr Gin Lys Lys Val Phe Leu Glu Pro Leu lie Lys Tie 
785 790 795 800 

Gly Leu Phe Asp Cys Phe Glu Pro Asn Arg Lys Lys lie Leu Asp Asn 
805 810 815 

Leu Asp Gly Leu Leu Val Phe Val Asn Glu Leu Gly Ser Leu Phe Ser 
820 825 830 

Asp Ser Ser Phe Ser Trp Val Asp Thr Lys Asp Tyr Ser Val Thr Glu 
835 840 845 

Lys Tyr Ser Leu Glu Gin Glu lie Val Gly Val Gly Met Ser Lys His 
850 855 860 

Pro Leu lie Asp He Ala Glu Lys Ser Thr Gin Thr Phe Thr Pro He 
865 870 875 880 

Ser Gin Leu Val Lys Glu Ser Glu Ala Val Val Leu He Gin lie Asp 
885 890 895 

Ser lie Arg lie He Arg Thr Lys Thr Ser Gly Gin Gin Met Ala Phe 
900 905 910 

Leu Ser Val Asn Asp Thr Lys Lys Lys Leu Asp Val Thr Leu Phe Pro 
915 920 925 

Gin Glu Tyr Ala He Tyr Lys Asp Gin Leu Lys Glu Gly Glu Phe Tyr 
930 935 940 

Tyr Leu Lys Gly Arg He Lys Glu Arg Asp His Arg Leu Gin Met Val 
945 950 955 960 

Cys Gin Gin Val Gin Met Ala He Ser Gin Lys Tyr Trp Leu Leu Val 
965 970 975 

Glu Asn His Gin Phe Asp Ser Gin He Ser Glu He Leu Gly Ala Phe 
980 985 990 
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Pro Gly Thr Thr Pro Val Val He His Tyr Gin Lys Asn Lys Glu Thr 
995 1000 1005 

lie Ala Leu Thr Lys Xle Gin Val Thr Glu Asn Leu Lys Glu Lys Leu 
1010 1015 1020 

Arg Pro Phe Val Leu Lys Thr Val Phe Arg 
1025 1030 



<210> 21 
<211> 1038 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 21 

atgattgcga tagaaaagat tgaaaaactg 
gtcacaggag atgacattgg tcagtatagc 
gcttttgata aggatgattt ggcctattct 
gatgcagaaa tggatctagt gagcctaccc 
gaccatttgt tagatatcac gaccaataaa 
gcctttgaag cctatttaga aaatccctta 
ggtaaattgg atagtaagag acggcttgtt 
gaagccaacc ctctgaaaga agcagagcta 
ctgggtttag gtttcgagag tggtgccttt 
tttagtcaaa tcatgaaaaa catggccttt 
agcctaactg atattgagca agccattcct 
actagacttg tcctaggagg taaaattgat 
ttatctggag aagatgacat taaattaatc 
ttgcagctga ctattcttgc tagagatgta 
tcagatattc ttgggcggcg ggttaatcct 
aggaccttat ctcttgcctt tctaacagga 
cagataaaaa caggacttta tgagaagagt 
atgactcact ctcaaaaa 



agtaaagaaa atttgggtct tataaccctt 60 

cagttgaaat cccgcttaat ggagcagatt 12 0 

tactttgata tgtctgaggc cgcttatcag 180 

ttctttgctg agcagaaggt ggttattttt 240 

aaaagtttct taaaagaaaa agacctaaag 3 00 

gagactactc gactaattat ctttgctcca 3 60 

aagcttttga aacgtgatgc cctfcgtttta 420 

agaacttatt ttcaaaaata cagtcatcaa 4 80 

gaccaattac ttttgaaatc aaacgatgat 540 

ttaaaagcct ataaaaaaac gggaaatatt 600 

aaaagtttac aagataatat tttcgatctg 660 

gcggctagag atttgattca tgatttacgg 72 0 

gctatcatgc taggccaatt tcgcttattt 780 

aaaaacgagc aacaactagt gattagttta 840 

taccaggtca agtatgcgtt aaaggattct 900 

gcggtgaaaa ccttgattga gacagattac 960 

tatctagttg atattgctct cttaaaaatc 1020 

1038 



<210> 22 
<211> 346 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 22 

Met Xle Ala Xle Glu Lys Xle Glu Lys Leu Ser Lys Glu Asn Leu Gly 
15 10 15 

Leu lie Thr Leu Val Thr Gly Asp Asp lie Gly Gin Tyr Ser Gin Leu 
20 25 30 
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Lys Ser Arg Leu Met Glu Gin lie Ala Phe Asp Lys Asp Asp Leu Ala 
35 40 45 

Tyr Ser Tyr Phe Asp Met Ser Glu Ala Ala Tyr Gin Asp Ala Glu Met 
50 55 60 

Asp Leu Val Ser Leu Pro Phe Phe Ala Glu Gin Lys Val Val lie Phe 
65 70 75 80 

Asp His Leu Leu Asp lie Thr Thr Asn Lys Lys Ser Phe Leu Lys Glu 
85 90 95 

Lys Asp Leu Lys Ala Phe Glu Ala Tyr Leu Glu Asn Pro Leu Glu Thr 
100 105 110 

Thr Arg Leu lie lie Phe Ala Pro Gly Lys Leu Asp Ser Lys Arg Arg 
115 120 125 

Leu Val Lys Leu Leu Lys Arg Asp Ala Leu Val Leu Glu Ala Asn Pro 
130 135 140 

Leu Lys Glu Ala Glu Leu Arg Thr Tyr Phe Gin Lys Tyr Ser His Gin 
145 150 155 160 

Leu Gly Leu Gly Phe Glu Ser Gly Ala Phe Asp Gin Leu Leu Leu Lys 
165 170 175 

Ser Asn Asp Asp Phe Ser Gin lie Met Lys Asn Met Ala Phe Leu Lys 
180 185 190 

Ala Tyr Lys Lys Thr Gly Asn lie Ser Leu Thr Asp lie Glu Gin Ala 
195 200 205 

lie Pro Lys Ser Leu Gin Asp Asn lie Phe Asp Leu Thr Arg Leu Val 
210 215 220 

Leu Gly Gly Lys lie Asp Ala Ala Arg Asp Leu lie His Asp Leu Arg 
225 230 235 240 

Leu Ser Gly Glu Asp Asp lie Lys Leu lie Ala He Met Leu Gly Gin 
245 250 255 

Phe Arg Leu Phe Leu Gin Leu Thr He Leu Ala Arg Asp Val Lys Asn 
260 265 270 

Glu Gin Gin Leu Val lie Ser Leu Ser Asp He Leu Gly Arg Arg Val 
275 280 285 



43 



WO 01/09164 



PCT/US00/20666 



Asn Pro Tyr Gin Val Lys Tyr Ala 
290 295 

Leu Ala Phe Leu Thr Gly Ala Val 
305 310 

Gin lie Lys Thr Gly Leu Tyr Glu 

325 

Leu Leu Lys lie Met Thr His Ser 
340 



Leu Lys Asp Ser Arg Thr Leu Ser 
300 

Lys Thr Leu He Glu Thr Asp Tyr 
315 320 

Lys Ser Tyr Leu Val Asp lie Ala 
330 335 

Gin Lys 
345 



<210> 23 
<211> 873 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 23 

atggatttag cgcaaaaagc tcctaacgtt tatcaagctt ttcagacaat tttaaagaaa 60 

gaccgtctga atcatgctta tcttttttcg ggtgattttg ctaatgaaga aatggctctt 120 

tttttagcta aggtcatctt ttgtgaacag aaaaaggatc agacgccctg cgggcattgt 180 

cgctcttgtc aattgattga acaaggagat tttgccgatg tgacggtatt ggaaccaaca 240 

gggcaagtga ttaaaacgga tgtggtcaaa gaaatgatgg ctaacttttc tcagacagga 3 00 

tatgaaaaca aacgacaagt ttttattatc aaagattgtg acaaaatgca tatcaatgcc 360 

gctaatagct tgctaaaata cattgaggag cctcagggag aagcttacat atttttattg 420 

accaatgatg ataacaaagt gcttccgacc attaaaagtc ggacacaggt ttttcagttt 480 

cctaaaaacg aagcctatct ttaccaattg gcacaagaaa agggattatt aaaccatcag 540 

gctaagctag tagccaaact tgccacaaac accagtcatc tagaacgtct gttgcaaacg 600 

agcaagcttt tagaactgat aactcaagca gagcgttttg tatctatttg gctgaaagat 660 

cagttgcagg catatttagc gttgaaccgt ctggtacagt tagcaactga aaaagaagaa 720 

caagatttag ttttgaccct tttgaccttg ctcttggcaa gagagcgtgc gcaaacgcct 7 80 

ttgacacaat tggaggctgt ctatcaggct aggctcatgt ggcagagcaa tgttaatttt 840 

caaaacacat tagaatatat ggtgatgtca gaa 873 



<210> 24 

<211> 291 

<212> PRT 

<213> Streptococcus pyogenes 

<400> 24 

Met Asp Leu Ala Gin Lys Ala Pro Asn Val Tyr Gin Ala Phe Gin Thr 
1 5 10 15 

He Leu Lys Lys Asp Arg Leu Asn His Ala Tyr Leu Phe Ser Gly Asp 
20 25 30 
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Phe Ala Asn Glu Glu Met Ala Leu Phe Leu Ala Lys Val lie Phe Cys 
35 40 45 

Glu Gin Lys Lys Asp Gin Thr Pro Cys Gly His Cy& Arg Ser Cys Gin 
50 55 60 

Leu lie Glu Gin Gly Asp Phe Ala Asp Val Thr Val Leu Glu Pro Thr 
65 70 75 80 

Gly Gin Val lie Lys Thr Asp Val Val Lys Glu Met Met Ala Asn Phe 
85 90 95 

Ser Gin Thr Gly Tyr Glu Asn Lys Arg Gin Val Phe lie lie Lys Asp 
100 105 110 

Cys Asp Lys Met His He Asn Ala Ala Asn Ser Leu Leu Lys Tyr He 
115 120 125 

Glu Glu Pro Gin Gly Glu Ala Tyr lie Phe Leu Leu Thr Asn Asp Asp 
130 135 140 

Asn Lys Val Leu Pro Thr He Lys Ser Arg Thr Gin Val Phe Gin Phe 
145 150 155 160 

Pro Lys Asn Glu Ala Tyr Leu Tyr Gin Leu Ala Gin Glu Lys Gly Leu 
165 170 175 

Leu Asn His Gin Ala Lys Leu Val Ala Lys Leu Ala Thr Asn Thr Ser 
180 185 190 

His Leu Glu Arg Leu Leu Gin Thr Ser Lys Leu Leu Glu Leu lie Thr 
195 200 205 

Gin Ala Glu Arg Phe Val Ser lie Trp Leu Lys Asp Gin Leu Gin Ala 
210 215 220 

Tyr Leu Ala Leu Asn Arg Leu Val Gin Leu Ala Thr Glu Lys Glu Glu 
225 230 235 240 

Gin Asp Leu Val Leu Thr Leu Leu Thr Leu Leu Leu Ala Arg Glu Arg 
245 250 255 

Ala Gin Thr Pro Leu Thr Gin Leu Glu Ala Val Tyr Gin Ala Arg Leu 
260 265 270 

Met Trp Gin Ser Asn Val Asn Phe Gin Asn Thr Leu Glu Tyr Met Val 
275 280 285 
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Met Ser Glu 
2 90 



<210> 25 
<211> 1665 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 25 

atgtatcaag ctctttatcg gaaataccgg agccaaacgt ttgacgaaat ggtgggacaa 60 

tcggttattt ccacaacttt aaagcaggca gttgaatctg gcaagattag ccatgcttat 120 

cttttttcag gtcctagagg gactgggaaa acaagtgcgg caaagatttt tgcaaaggcc 180 

atgaattgtc ctaaccaagt cgatggtgaa ccctgtaatc aatgcgatat ttgccgagat 240 

atcacgaatg gaagcttgga agatgtgatt gaaattgatg ctgcctcgaa taatggtgtt 300 

gatgaaattc gtgacattcg agacaaatca acctatgcgc caagtcgtgc gacttacaag 360 

gtttatatta ttgatgaggt tcacatgtta tcaacagggg cttttaatgc gcttttgaaa 420 

actttggaag aaccgacaga atgttgtctt tatcttggca acaacggaat gcataaaatt 4 80 

ccagccacta ttttatctcg tgtgcaacgc tttgaattca aagctattaa gcaaaaagct 540 

attcgagagc atttagcctg ggttttggac aaagaaggta ttgcctatga ggtggatgct 600 

ttaaatctca ttgcaaggcg agcagaagga ggcatgcgtg atgctttatc tattttagat 660 

caggctttga gcttgtcacc agataatcag gtcgccattg caattgccga agaaattaca 720 

ggttctattt ccatacttgc tctgggtgac tatgttcgat atgtctccca agaacaggct 7 80 

acgcaagctc tggcagcctt agagaccatt tatgatagtg ggaagagcat gagccgcttt 840 

gcgacagatt tattgaccta tctgcgtgat ttattggtgg ttaaagctgg cggcgacaat 900 

caacgtcagt cagctgtttt tgataccaat ttgtctctct cgatagatcg tatattccaa 960 

atgataacag ttgttactag tcatctccct gaaatcaaaa agggaaccca tcctcggatt 1020 

tatgccgaaa tgatgactat ccaattagct cagaaagagc agattttgtc ccaagtaaac 1080 

ttgtcaggag agttaatctc agagattgaa acgctcaaaa atgagttggc acaacttaaa 1140 

caacaattgt cgcagctcca atcgcgtcct gattcactgg caagatctga taaaacgaaa 1200 

cctaaaacca caagctacag ggttgatcgg gttaccattt tgaaaatcat ggaagaaacg 12 60 

gttcgaaata gccaacaatc tcgacaatat ctagatgctc taaaaaatgc ttggaatgaa 1320 

attctagata acatttctgc ccaagacaga gccttattga tgggctctga gcctgtctta 1380 

gcaaatagtg agaatgcgat tttggctttc gaggctgcct ttaatgcaga acaagtcatg 1440 

agccgaaata atcttaatga tatgtttggt aacattatga gtaaagctgc tggtttttct 150 0 

cccaatattc tggcagtacc aaggacagat tttcagcata ttcgtaagga atttgctcag 1560 

caaatgaaat cgcaaaaaga cagtgttcaa gaagaacaag aagtagcgct tgatattcca 1620 

gaagggtttg attttttgct cgataaaata aatactattg acgac 1665 



<210> 26 
<211> 555 
<212> PRT 

<213> Streptococcus, pyogenes 
<400> 26 

Met Tyr Gin Ala Leu Tyr Arg Lys Tyr Arg Ser Gin Thr Phe Asp Glu 
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15 10 15 

Met Val Gly Gin Ser Val lie Ser Thr Thr Leu Lys Gin Ala Val Glu 
20 25 30 

Ser Gly Lys lie Ser His Ala Tyr Leu Phe Ser Gly Pro Arg Gly Thr 
35 40 45 

Gly Lys Thr Ser Ala Ala Lys lie Phe Ala Lys Ala Met Asn Cys Pro 
50 55 60 

Asn Gin Val Asp Gly Glu Pro Cys Asn Gin Cys Asp lie Cys Arg Asp 
65 70 75 80 

lie Thr Asn Gly Ser Leu Glu Asp Val lie Glu lie Asp Ala Ala Ser 
85 90 95 

Asn Asn Gly Val Asp Glu lie Arg Asp lie Arg Asp Lys Ser Thr Tyr 
100 105 110 

Ala Pro Ser Arg Ala Thr Tyr Lys Val Tyr lie lie Asp Glu Val His 
115 120 125 

Met Leu Ser Thr Gly Ala Phe Asn Ala Leu Leu Lys Thr Leu Glu Glu 
130 135 140 

Pro Thr Glu Asn Val Phe lie Leu Ala Thr Thr Glu Leu His Lys lie 
145 150 155 160 

Pro Ala Thr lie Leu Ser Arg Val Gin Arg Phe Glu Phe Lys Ala lie 
165 170 175 

Lys Gin Lys Ala He Arg Glu His Leu Ala Trp Val Leu Asp Lys Glu 
180 185 190 

Gly lie Ala Tyr Glu Val Asp Ala Leu Asn Leu Tie Ala Arg Arg Ala 
195 200 205 

Glu Gly Gly Met Arg Asp Ala Leu Ser He Leu Asp Gin Ala Leu Ser 
210 215 220 

Leu Ser Pro Asp Asn Gin Val Ala lie Ala lie Ala Glu Glu lie Thr 
225 230 235 240 

Gly Ser lie Ser lie Leu Ala Leu Gly Asp Tyr Val Arg Tyr Val Ser 
245 250 255 

Gin Glu Gin Ala Thr Gin Ala Leu Ala Ala Leu Glu Thr lie Tyr Asp 
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260 265 270 

Ser Gly Lys Ser Met Ser Arg Phe Ala Thr Asp Leu Leu Thr Tyr Leu 
275 280 285 

Arg Asp Leu Leu Val Val Lys Ala Gly Gly Asp Asn Gin Arg Gin Ser 
290 295 300 

Ala Val Phe Asp Thr Asn Leu Ser Leu Ser lie Asp Arg lie Phe Gin 
305 310 315 320 

Met lie Thr Val Val Thr Ser His Leu Pro Glu lie Lys Lys Gly Thr 
325 330 335 

His Pro Arg lie Tyr Ala Glu Met Met Thr He Gin Leu Ala Gin Lys 
340 345 350 

Glu Gin Xle Leu Ser Gin Val Asn Leu Ser Gly Glu Leu He Ser Glu 
355 360 365 

Xle Glu Thr Leu Lys Asn Glu Leu Ala Gin Leu Lys Gin Gin Leu Ser 
370 375 380 

Gin Leu Gin Ser Arg Pro Asp Ser Leu Ala Arg Ser Asp Lys Thr Lys 
385 390 395 400 

Pro Lys Thr Thr Ser Tyr Arg Val Asp Arg Val Thr lie Leu Lys lie 
405 410 415 

Met Glu Glu Thr Val Arg Asn Ser Gin Gin Ser Arg Gin Tyr Leu Asp 
420 425 430 

Ala Leu Lys Asn Ala Trp Asn Glu lie Leu Asp Asn lie Ser Ala Gin 
435 440 445 

Asp Arg Ala Leu Leu Met Gly Ser Glu Pro Val Leu Ala Asn Ser Glu 
450 455 460 

Asn Ala lie Leu Ala Phe Glu Ala Ala Phe Asn Ala Glu Gin Val Met 
465 470 475 480 

Ser Arg Asn Asn Leu Asn Asp Met Phe Gly Asn Xle Met Ser Lys Ala 
485 490 495 

Ala Gly Phe Ser Pro Asn lie Leu Ala Val Pro Arg Thr Asp Phe Gin 
500 505 510 

His lie Arg Lys Glu Phe Ala Gin Gin Met Lys Ser Gin Lys Asp Ser 
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515 520 525 

Val Gin Glu Glu Gin Glu Val Ala Leu Asp lie Pro Glu Gly Phe Asp 
530 535 540 

Phe Leu Leu Asp Lye He Asn Thr Xle Asp Asp 
545 550 555 



<210> 27 
<211> 1134 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 27 

atgattcaat tttcaattaa tcgcacatta tttattcatg ctttaaatac aactaaacgt 60 

gctattagca ctaaaaatgc cattcctatt ctttcatcaa taaaaattga agtcacttct 120 

acaggagtaa ctttaacagg gtctaacggt caaatatcaa ttgaaaacac tattcctgta 180 

agtaatgaaa atgctggttt gctaattacc tctccaggag ctattttatt agaagctagt 240 

ttttttatta atattatttc aagtttgcca gatattagta taaatgttaa agaaattgaa 300 

caacaccaag ttgttttaac cagtggtaaa tcagagatta ccttaaaagg aaaagatgtt 360 

gaccagtatc ctcgtctaca agaagtatca acagaaaatc ctttgatttt aaaaacaaaa 420 

ttattgaagt ctattattgc tgaaacagct tttgcagcca gtttacaaga aagtcgtcct 480 

attttaacag gagttcatat tgtattaagt aatcataaag attttaaagc agtagcgact 540 

gactctcatc gtatgagcca acgtttaatc actttggaca atacttcagc agatttgatg 600 

gtagttcttc caagtaaatc tttgagagaa ttttcagcag tatttacaga tgatattgag 660 

accgttgagg tatttttctc accaagccaa atcttgttca gaagtgaaca catttctttt 720 

tatacacgcc tcttagaagg aaattatccc gatacagacc gtttattaat gacagaattt 7 80 

gagacggagg ttgttttcaa tacccaatcc cttcgccacg ctatggaacg tgccttcttg 840 

atttctaatg ctactcaaaa tggtactgtt aagcttgaga ttactcaaaa tcatatttca 900 

gctcatgtta actcacctga ggttggtaag gtaaacgagg atttagatat tgttagtcag 960 

tctggtagtg atttaactat cagcttcaat ccaacttacc ttattgagtc tttaaaagct 1020 

attaaaagtg aaacagtaaa aattcatttc ttatcaccag ttcgaccatt caccctaaca 1080 

ccaggcgatg aggaagaaag ttttatccaa ttaattacac cagtacgaac aaac 1134 



<210> 28 
<211> 378 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 28 

Met Xle Gin Phe Ser He Asn Arg Thr Leu Phe Xle His Ala Leu Asn 
15 10 15 

Thr Thr Lys Arg Ala He Ser Thr Lys Asn Ala He Pro He Leu Ser 
20 25 30 
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Ser lie Lys lie Glu Val Thr Ser Thr Gly Val Thr Leu Thr Gly Ser 
35 40 45 

Asn Gly Gin lie Ser He Glu Asn Thr He Pro Val Ser Asn Glu Aen 
50 55 60 

Ala Gly Leu Leu lie Thr Ser Pro Gly Ala He Leu Leu Glu Ala Ser 
65 70 75 80 

Phe Phe lie Asn lie lie Ser Ser Leu Pro Asp lie Ser He Asn Val 
85 90 95 

Lys Glu He Glu Gin His Gin Val Val Leu Thr Ser Gly Lys Ser Glu 
100 105 110 

He Thr Leu Lys Gly Lys Asp Val Asp Gin Tyr Pro Arg Leu Gin Glu 
115 120 125 

Val Ser Thr Glu Asn Pro Leu He Leu Lys Thr Lys Leu Leu Lys Ser 
130 135 140 

He He Ala Glu Thr Ala Phe Ala Ala Ser Leu Gin Glu Ser Arg Pro 
145 150 155 160 

He Leu Thr Gly Val His He Val Leu Ser Asn His Lys Asp Phe Lys 
165 170 175 

Ala Val Ala Thr Asp Ser His Arg Met Ser Gin Arg Leu He Thr Leu 
180 185 190 

Asp Asn Thr Ser Ala Asp Leu Met Val Val Leu Pro Ser Lys Ser Leu 
195 200 205 

Arg Glu Phe Ser Ala Val Phe Thr Asp Asp He Glu Thr Val Glu Val 
210 215 220 

Phe Phe Ser Pro Ser Gin He Leu Phe Arg Ser Glu His He Ser Phe 
225 230 235 240 

Tyr Thr Arg Leu Leu Glu Gly Asn Tyr Pro Asp Thr Asp Arg Leu Leu 
245 250 255 

Met Thr Glu Phe Glu Thr Glu Val Val Phe Asn Thr Gin Ser Leu Arg 
260 265 270 

His Ala Met Glu Arg Ala Phe Leu He Ser Asn Ala Thr Gin Asn Gly 
275 280 285 
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Thr Val Lys Leu Glu 
290 

Ser Pro Glu Val Gly 
305 

Ser Gly Ser Asp Leu 
325 

Ser Leu Lys Ala Tie 
340 

Pro Val Arg Pro Phe 
355 

lie Gin Leu lie Thr 
370 



lie Thr Gin Asn His lie 
295 

Lys Val Asn Glu Asp Leu 

310 315 

Thr lie Ser Phe Asn Pro 
330 

Lys Ser Glu Thr Val Lys 
345 

Thr Leu Thr Pro Gly Asp 
360 

Pro Val Arg Thr Asn 
375 



Ser Ala His Val Asn 
300 

Asp He Val Ser Gin 
320 

Thr Tyr Leu He Glu 
335 

Xle His Phe Leu Ser 
350 

Glu Glu Glu Ser Phe 
365 



<210> 29 
<211> 492 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 29 

atgattaata atgtagtact agttggtcgc 
ccaagtcaag tagctgtggc taccttcaca 
aatggtgaac gcgaggcaga tttcattaac 
ttagcgaact gggctaaaaa aggtgctttg 
aactacgaaa accaacaagg acaacgtgtc 
caaatgttgg aaagtcgtgc tacacgtgaa 
tttaacaata acacttcatc atcaaacagt 
tttggaagag atgatagccc atttgggaac 
cttccattct ag 



atgaccaagg atgcagaact tcgttacaca 60 
cttgctgtta accgtacctt taaaagccaa 120 
tgtgtgatct ggcgtcaacc ggctgaaaat 180 
atcggagtta cgggtcgtat tcatacacgt 240 
tatgtaacag aagttgttgc agataatttc 300 
ggtggctcaa ctggctcatt taatggtggt 360 
tactcagcgc ctgcacaaca aacgcctaac 420 
tcaaacccga tggatatctc agatgacgat 480 

492 



<210> 30 
<211> 163 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 30 

Met He Asn Asn Val Val Leu Val Gly Arg Met Thr Lys Asp Ala Glu 
15 10 15 

Leu Arg Tyr Thr Pro Ser Gin Val Ala Val Ala Thr Phe Thr Leu Ala 
20 25 30 
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Val Asn Arg Thr Phe Lys Ser Gin Asn Gly Glu Arg Glu Ala Asp Phe 
35 40 45 

lie Asn Cys Val lie Trp Arg Gin Pro Ala Glu Asn Leu Ala Asn Trp 
50 55 60 

Ala Lys Lys Gly Ala Leu lie Gly Val Thr Gly Arg He Gin Thr Arg 
65 70 75 80 

Asn Tyr Glu Asn Gin Gin Gly Gin Arg Val Tyr Val Thr Glu Val Val 
85 90 95 

Ala Asp Asn Phe Gin Met Leu Glu Ser Arg Ala Thr Arg Glu Gly Gly 
100 105 110 

Ser Thr Gly Ser Phe Asn Gly Gly Phe Asn Asn Asn Thr Ser Ser Ser 
115 120 125 

Asn Ser Tyr Ser Ala Pro Ala Gin Gin Thr Pro Asn Phe Gly Arg Asp 
130 135 140 

Asp Ser Pro Phe Gly Asn Ser Asn Pro Met Asp He Ser Asp Asp Asp 
145 150 155 160 

Leu Pro Phe 



<210> 31 
<211> 1815 
<212> DNA 

<213> Streptococcus pyogenes 
<400> 31 

atgggattfcfc tatggggagg tgacgatttg 
aaaaatagcg ttaatattgt cgatgtcatt 
cggcattacc tcgggctttg cccatttcat 
gaagacagac aattttttca ctgctttggc 
attgaggaat accgccaagt ccccttctta 
ggtatgtcgc ttaatatacc gccaagtcag 
aatcacgctt tgatgacact tcatgaggat 
accactacca ttggtcaaga agctaggaag 
ttaattgagc atttcaatat tggtttagcc 
ctttctaaaa aatacgagga aggtcaattg 
caatccaata ccatttacga cgcctttcga 
cgagggcata ttattgcctt ttcaggacgt 
caggcaaagt ataaaaattc aagaggaaca 
catctggaca aggcaaggcc tgttattgcc 



gcaattgaca aagaaatgat ttcccaagta 60 
ggagaagtgg tcaaactttc ccgatcaggg 120 
aaggaaaaga caccctcttt taatgttgtt 180 
tgtggaaaat caggggatgt ttttaaattt 240 
gaaagtgttc agattattgc cgataagact 3 00 
gcagtacttg ctagccaaca caagcaccct 360 
gctgctaaat tttaccatgc agttttgatg 420 
tacctttacc agagaggctt ggatgaccaa 480 
ccagatgagt cagattatct ttatcaagct 540 
gttgcttcag gattgtttca cttgtccgat 600 
aatcgtatca tgtttccctt atcagatgac 660 
atctggacgg cagctgatat ggaaaagaga 720 
gttcttttta acaaatctta tgaattgtat 780 
aaaacccatg aagtgtttct aatggaaggg 840 
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tttatggacg tgattgccgc ttaccgttcc 
acggcattga ctcaagaaca tgtcaatcac 
atttatgatg gtgacgatgc tggacaacat 
gattttgttg tcgaaattgt cagaatcccc 
cggcattccc cagaagcatt tgcagatttg 
ttttttattg attacctaaa acctactaat 
gtggagaaaa tggcaccatt gattgctcaa 
attaacaaga ttgctgattt gttgccaaac 
aatgcattaa ggattcaaga taggcaaaaa 
aatcttgtga ccttaccaat gccaaaaagt 
ctcatgcatc ggctcttaca tcatgactat 
ttttattttg atacctctac cttagaatta 
attacatctt atgatttgtc agagatgtca 
ttagaagaaa accttcccaa agaagtagct 
cgtgccaaac ttttagcaga gcgcgatctt 
agtaacaaag gcgatcatca agcggctcta 
cgaaaaatgg aatag 



ggctatgaaa atgctgttgc ttcaatgggg 900 

cttaagcaag tcactaaaaa agttgttttg 960 

gctattgcaa aatcactaga attgcttaaa 1020 

aataaaatgg atcctgacga atttgtacaa 1080 

cttaagcagt cacggatcag tagtgttgaa 1140 

gtagacaatt tgcaatcaca aattgtttat 12 00 

tcaccatcca tcacagctca acattcgtat 1260 

tttgactatt ttcaagtaga acaatcagta 1320 

catcaaggtc aaatagctca agccgtcagc 1380 

ttgacagcta ttgctaagac agaaagtcat 1440 

ttattaaatg aatttcgaca tcgtgatgat 1500 

ctttatcaac ggctgaagca acaaggacac 1560 

gaggaagtta accgtgctta ttacaatgtt 1620 

cttggtgaga ttgatgatat tttatccaaa 1680 

cacaaacaag ggaaaaaagt tagagaatct 1740 

gaagtactag aacattttat tgcgcagaaa 1800 

1815 



<210> 32 
<211> 600 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 32 

Met Gly Phe Leu Trp Gly Gly Asp Asp Leu Ala He Asp Lys Glu Met 
15 10 15 

He Ser Gin Val Lys Asn Ser Val Asn lie Val Asp Val He Gly Glu 
20 25 30 

Val Val Lys Leu Ser Arg Ser Gly Arg His Tyr Leu Gly Leu Cys Pro 
35 40 45 

Phe His Lys Glu Lys Thr Pro Ser Phe Asn Val Val Glu Asp Arg Gin 
50 55 60 

Phe Phe His Cys Phe Gly Cys Gly Lys Ser Gly Asp Val Phe Lys Phe 
65 70 75 80 

lie Glu Glu Tyr Arg Gin Val Pro Phe Leu Glu Ser Val Gin lie lie 
85 90 95 

Ala Asp Lys Thr Gly Met Ser Leu Asn lie Pro Pro Ser Gin Ala Val 
100 105 110 

Leu Ala Ser Gin His Lys His Pro Asn His Ala Leu Met Thr Leu His 
115 120 125 
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Glu Asp Ala Ala Lys 
130 

Gly Gin Glu Ala Arg 
145 

Leu lie Glu His Phe 
165 

Leu Tyr Gin Ala Leu 
180 

Ser Gly Leu Phe His 
195 

Phe Arg Asn Arg lie 
210 

lie Ala Phe Ser Gly 
225 

Gin Ala Lys Tyr Lys 
245 

Tyr Glu Leu Tyr His 
260 

His Glu Val Phe Leu 
275 

Arg Ser Gly Tyr Glu 
290 

Gin Glu His Val Asn 
305 

lie Tyr Asp Gly Asp 
325 

Glu Leu Leu Lys Asp 
340 

Met Asp Pro Asp Glu 

355 

Asp Leu Leu Lys Gin 
370 



Phe Tyr His Ala Val Leu 
135 

Lys Tyr Leu Tyr Gin Arg 

150 155 

Asn lie Gly Leu Ala Pro 
17 0 

Ser Lys Lys Tyr Glu Glu 
185 

Leu Ser Asp Gin Ser Asn 
200 

Met Phe Pro Leu Ser Asp 
215 

Arg lie Trp Thr Ala Ala 
230 235 

Asn Ser Arg Gly Thr Val 
250 

Leu Asp Lys Ala Arg Pro 
265 

Met Glu Gly Phe Met Asp 
280 

Asn Ala Val Ala Ser Met 
295 

His Leu Lys Gin Val Thr 
310 315 

Asp Ala Gly Gin His Ala 
330 

Phe Val Val Glu He Val 
345 

Phe Val Gin Arg His Ser 
360 

Ser Arg He Ser Ser Val 
375 



Met Thr Thr Thr lie 
140 

Gly Leu Asp Asp Gin 
160 

Asp Glu Ser Asp Tyr 
175 

Gly Gin Leu Val Ala 
190 

Thr He Tyr Asp Ala 
205 

Asp Arg Gly His He 
220 

Asp Met Glu Lys Arg 
240 

Leu Phe Asn Lys Ser 
255 

Val He Ala Lys Thr 
270 

Val He Ala Ala Tyr 
285 

Gly Thr Ala Leu Thr 
300 

Lys Lys Val Val Leu 
320 

He Ala Lys Ser Leu 
335 

Arg He Pro Asn Lys 
350 

Pro Glu Ala Phe Ala 
365 

Glu Phe Phe He Asp 
380 
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Tyr Leu Lys Pro Thr Asn Val Asp Asn Leu Gin Ser Gin lie Val Tyr 
385 390 395 400 

Val Glu Lys Met Ala Pro Leu lie Ala Gin Ser Pro Ser lie Thr Ala 
405 410 415 

Gin His Ser Tyr lie Asn Lys lie Ala Asp Leu Leu Pro Asn Phe Asp 
420 425 430 

Tyr Phe Gin Val Glu Gin Ser Val Asn Ala Leu Arg lie Gin Asp Arg 
435 440 445 

Gin Lys His Gin Gly Gin lie Ala Gin Ala Val Ser Asn Leu Val Thr 
450 455 460 

Leu Pro Met Pro Lys Ser Leu Thr Ala lie Ala Lys Thr Glu Ser His 
465 470 475 480 

Leu Met His Arg Leu Leu His His Asp Tyr Leu Leu Asn Glu Phe Arg 
485 490 495 

His Arg Asp Asp Phe Tyr Phe Asp Thr Ser Thr Leu Glu Leu Leu Tyr 
500 505 510 

Gin Arg Leu Lys Gin Gin Gly His lie Thr Ser Tyr Asp Leu Ser Glu 
515 520 525 

Met Ser Glu Glu Val Asn Arg Ala Tyr Tyr Asn Val Leu Glu Glu Asn 
530 535 540 

Leu Pro Lys Glu Val Ala Leu Gly Glu lie Asp Asp lie Leu Ser Lys 
545 550 555 560 

Arg Ala Lys Leu Leu Ala Glu Arg Asp Leu His Lys Gin Gly Lys Lys 
565 570 575 

Val Arg Glu Ser Ser Asn Lys Gly Asp His Gin Ala Ala Leu Glu Val 
580 585 590 

Leu Glu His Phe lie Ala Gin Lys 
595 600 



<210> 33 
<211> 1368 
<212> DNA 

<213> Streptococcus pyogenes 
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<400> 33 

atgaggttgc ctgaagtagc tgaattacga 
tctgttcttg ggtcaatctt tatctcacct 
agtccagacg atttttataa gtacgctcat 
agcgatcgta atgatgccat tgatgcaacc 
gatctgcaaa gtattggtgg cfctatcctat 
agtgctaatg cagaatatta tgctaaaatt 
attgctaggt tgacagaatc tgtcaaccta 
gttatcgctg gagttgagag agctttaatt 
tttcgcaaaa tttcagatgt gctaaaagtt 
cagacttcaa atgttacagg tttaccaact 
ggtttacacc cagatcaatt agttatttta 
tttgttctta atattgcgca aaatgtgggg 
tctttggaaa tgggtgctga aagtttagta 
gattcgcaca gtttaagaac agggcaactc 
gctcagggag ctttggcaga agcaccgatt 
actgaaatcc gcgcaagatc acggaaattg 
attgtaattg actacttaca gttgattaca 
gtttcagata tttcaagaca gcttaaaatc 
gccctaagtc agctttctcg tggcgttgag 
gatattcgtg aatcaggatc tattgagcag 
gacgattatt accgtaaaga atgtgatgat 
gaagttatcc tcgagaaaaa tagagctggg 
aaagaataca acaaattctc aagtatagcc 



gttcaacccc aagatttact agcagagcaa 60 
gataagctga ttgcagtgag agaatttatc 120 
aaaattatct ttcgggcaat gattaccctc 180 
actataagaa caatcctaga tgatcaagat 240 
attgttgaac tagttaatag tgtcccaact 300 
gtagctgaga aagctatgtt gcgtgatatt 3 60 
gcttatgatg aaattttaaa accagaagag 420 
gaactcaatg aacatagtaa tcgtagtggg 480 
aattacgagg ctttagaagc acgttctaag 540 
ggttttagag accttgacaa gattacaaca 600 
gctgctcggc cagcagtggg gaagactgcc 660 
actaagcaaa aaaagactgt tgctattttt 720 
gatcgtatgc ttgcagcaga aggaatggtt 7 80 
acagatcagg attggaataa tgtaacaatt 84 0 
tatattgacg atacgcccgg gattaaaatt 900 
tctcaagaag tggatggtgg tttaggtctc 960 
ggaactaaac ccgaaaatcg tcagcaagag 102 0 
ctagctaaag aattgaaagt accagttatt 1080 
caaaggcaag ataaacgacc agttttatca 1140 
gatgccgata ttgtagcctt cttataccgg 12 00 
gctgaagaag ctgttgaaga taacacaatt 12 60 
gcgcgtggaa cagtcaaact gatgttccaa 1320 
cagtttgaag aaagataa 1368 



<210> 34 
<211> 455 
<212> PRT 

<213> Streptococcus pyogenes 
<400> 34 

Met Arg Leu Pro Glu Val Ala Glu Leu Arg Val Gin Pro Gin Asp Leu 
15 10 15 

Leu Ala Glu Gin Ser Val Leu Gly Ser lie Phe He Ser Pro Asp Lys 
20 25 30 

Leu He Ala Val Arg Glu Phe lie Ser Pro Asp Asp Phe Tyr Lys Tyr 
35 40 45 

Ala His Lys lie He Phe Arg Ala Met He Thr Leu Ser Asp Arg Asn 
50 55 » 60 

Asp Ala He Asp Ala Thr Thr He Arg Thr He Leu Asp Asp Gin Asp 
65 70 75 80 
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Asp Leu Gin Ser lie Gly Gly Leu Ser Tyr lie Val Glu Leu Val Asn 
85 90 95 

Ser Val Pro Thr Ser Ala Asn Ala Glu Tyr Tyr Ala Lys lie Val Ala 
100 105 110 

Glu Lys Ala Met Leu Arg Asp lie lie Ala Arg Leu Thr Glu Ser Val 
115 120 125 

Asn Leu Ala Tyr Asp Glu lie Leu Lys Pro Glu Glu Val lie Ala Gly 
130 135 140 

Val Glu Arg Ala Gin Gly Ala Leu Ala Glu Ala Pro lie Tyr lie Asp 
145 150 155 160 

Asp Thr Pro Gly Xle Lys lie Ala Leu Xle Glu Leu Asn Glu His Ser 
165 170 175 

Asn Arg Ser Gly Phe Arg Lys lie Ser Asp Val Leu Lys Val Asn Tyr 
180 185 190 

Glu Ala Leu Glu Ala Arg Ser Lys Gin Thr Ser Asn Val Thr Gly Leu 
195 200 205 

Pro Thr Gly Phe Arg Asp Leu Asp Lys lie Thr Thr Gly Leu His Pro 
210 215 220 

Asp Gin Leu Val lie Leu Ala Ala Arg Pro Ala Val Gly Lys Thr Ala 
225 230 235 240 

Phe Val Leu Asn Xle Ala Gin Asn Val Gly Thr Lys Gin Lys Lys Thr 
245 250 255 

Val Ala Xle Phe Ser Leu Glu Met Gly Ala Glu Ser Leu Val Asp Arg 
260 265 270 

Met Leu Ala Ala Glu Gly Met Val Asp Ser His Ser Leu Arg Thr Gly 
275 280 285 

Gin Leu Thr Asp Gin Asp Trp Asn Asn Val Thr lie Thr Glu lie Arg 
290 295 300 

Ala Arg Ser Arg Lys Leu Ser Gin Glu Val Asp Gly Gly Leu Gly Leu 
305 310 315 320 

lie Val lie Asp Tyr Leu Gin Leu lie Thr Gly Thr Lys Pro Glu Asn 
325 330 335 
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Arg Gin Gin Glu Val 
340 

Lys Glu Leu Lys Val 
355 

Val Glu Gin Arg Gin 
370 

Sex- Gly Ser lie Glu 
385 

Asp Asp Tyr Tyr Arg 
405 

Asp Asn Thr He Glu 
420 

Gly Thr Val Lys Leu 
435 

He Ala Gin Phe Glu 
450 



Ser Asp lie Ser Arg Gin 
345 

Pro Val lie Ala Leu Ser 
360 

Asp Lys Arg Pro Val Leu 

375 

Gin Asp Ala Asp lie Val 
390 395 

Lys Glu Cys Asp Asp Ala 
410 

Val Xle Leu Glu Lys Asn 
425 

Met Phe Gin Lys Glu Tyr 
440 

Glu Arg 
455 



Leu Lys Xle Leu Ala 
350 

Gin Leu Ser Arg Gly 
365 

Ser Asp Tie Arg Glu 
380 

Ala Phe Leu Tyr Arg 
400 

Glu Glu Ala Val Glu 
415 

Arg Ala Gly Ala Arg 
430 

Asn Lys Phe Ser Ser 
445 



<210> 35 
<211> 29 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 35 

ggtggtaatt gtcttgcata tgacagagc 2 9 



<210> 36 

<211> 31 

<212> DNA 

<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: primer 
<400> 36 

agcgattaag tggattgccg ggttgtgatg c 31 
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<210> 37 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 37 

agcatcacaa cccggcaatc cacttaatcg c 31 

<210> 38 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 39 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 39 

gaagatgcat ataaacgtgc aagacctagt 30 

<210> 40 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 38 



gactacgcca tgggcattaa ataaatacc 



29 



<400> 40 



gtctgacgca cgaattgtaa agtaagatgc atag 



34 
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<210> 41 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 41 

cgactggaag gagttttaac atatgatgga attcac 3 6 



<210> 42 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 42 

ttatatggat ccttagtaag ttctgattgg 3 0 



<210> 43 
<211> 15 
<212> PRT 

<213> Escherichia coli 
<400> 43 

Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 
15 10 15 



<210> 44 
<211> 15 
<212> PRT 

<213> Escherichia coli 
<400> 44 

Lys Phe Ala Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr 
15 10 15 



<210> 45 
<211> 44 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 45 

cttctttttg aaagatttct aaataaagaa cgttattcaa tgcc 44 



<210> 46 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 46 

ataagctgca gcatgacttt tattaaaacc ataacctgca aattt 45 

<210> 47 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 47 

agttaaaaat gccatatttt gacgtgtttt agttctaat 3 9 



<210> 48 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: primer 
<400> 48 

cttgcaaaag cggttgctaa agatgttgga cgaattatgg gg 42 

<210> 49 
<211> 10 
<212> PRT 
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<213> Escherichia coli 



<400> 49 



His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
15 10 



<210> 50 
<211> 10 
<212> PRT 

<213> Escherichia coli 
<400> 50 

His Ala Tyr Leu Phe Ser Gly Pro Arg Gly 
15 10 



<210> 51 

<211> 38 

<212> DNA 

<213> Artificial Sequence 



<223> Description of Artificial Sequence: primer 



<210> 52 
<211> 39 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 52 

ccggaattct ggtggttctt ctaatgtttt taataatgc 3 9 

<210> 53 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<220> 



<400> 



51 



cgcggatccc atgcatattt attttcaggt ccaagagg 



38 



62 



WO 01/09164 



PCT/US00/20666 



<400> 53 

tttgtaaagg cattacgcag gggactaatt cagatgtg 38 



<210> 54 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 54 

tatgacattc attacaaggt tctccatcag tgc 33 



<210> 55 
<211> 29 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 55 

gagcactgat gaacttagaa ttagatatg 2 9 



<210> 56 
<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 56 

gatactcagt atctttctca gatgttttat tc 32 



<210> 57 
<211> 14 
<212> PRT 

<213> Escherichia coli 
<400> 57 

Asp Leu He He Val Ala Ala Arg Pro Ser Met Gly Lys Thr 
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15 10 



<210> 58 
<211> 15 
<212> PRT 

<213> Escherichia coli 
<400> 58 

Glu lie lie He Gly Lys Gin Arg Asn Gly Pro He Gly Thr Val 
15 10 15 



<210> 59 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 59 

gaccttataa ttgtagctgc acgtccttct atgggaaaaa c 41 



<210> 60 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 60 

aacattatta agtcagcatc ttgttctatt gatccagatt caacgaag 48 

<210> 61 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 61 

gatttgtagt tctggtaatg ttgactcaaa ccgcttaaga accgg 45 



64 
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c210> 62 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 62 

atacgtgtgg ttaactgatc agcaacccat ctctagtgag aaaatacc 48 

<210> 63 
<211> 31 
<212> DNA 

<213> Artificial Sequence 



<210> 64 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 64 

cattgctaag caacgttacg gtccaacagg c 31 

<210> 65 
<211> 69 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 65 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 
gatgaacag 69 



<220> 



<223> Description of Artificial Sequence: primer 



<400> 63 



cgttttaatg catgcttaga aacgatatca g 



31 
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<210> 66 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 

<400> 66 

aattttaaag gatccgtgta taatattcta attttcccg 



<210> 67 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 67 

gggagtttgt aatccatgga tgaacagc 

<210> 68 
<211> 37 
<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Description of Artificial Sequence: primer 
<400> 68 

ctgaacacct attaccctag gcatctaact cacaccc 

<210> 69 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 69 

ggagcagatt gcttttgata catatgattg gcctattc 



66 
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<210> 70 
<211> 46 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 70 

ttgtctccgc atcaaactgg gatccaagag catcatacgc gtatgg 46 

<210> 71 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence; primer 



<210> 72 
<211> 44 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 72 

cgggcaagtc ttttgacaag cttcggatcc ccataacgaa ttcc 44 

<210> 73 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 71 



gcctaggata agggagggta catatggatt tagcgc 



36 



<400> 73 



ggagttaaaa acatatgtat caagctcttt ate 



33 
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<210> 74 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Description of Artificial Sequence: primer 
<400> 74 

cgtgggtaag ggcaaaacgg atcccttatg tatttcag 38 

<210> 75 
<211> 34 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 76 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 76 

tatcagctcc tggatccagt accttccatt gattagcc 3 8 



«c210> 77 
<211> 74 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 77 

ggataacaat tccccgctag caataatttt gtttaacttt aagaaggaga tatacccatg 60 



<400> 75 

ggagttcata tgattcaatt ttcaaattaa tcgc 



34 
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tcagatttat tcgc 



74 



<210> 78 
<211> 57 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 78 

cggtgtctct atctaaatga ctcatttggg atcctcgctt tatacggtat gtcacag 57 

<210> 79 
<211> 43 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<210> 80 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 80 

cgaatagcag cgttcatacc aggatcctcg ccgccactgg 40 

<210> 81 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 



<400> 79 



gggaacaaga taaccaagga ggaacccatg gttgctcaac ttg 



43 



<400> 81 



69 
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I hereby claim the benefit under Title 35, United States Code, § 120 of any United States application(s) or PCT international application(s) 
designating the United States of America that is/are listed below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in that/those prior application(s) in the manner provided by the first paragraph of Title 35, United States Code, § 1 1 2, I acknowledge 
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UNDER 35 U.S.C. 120: 



U.S. APPLICATIONS 



STATUS (Check One) 



U.S. APPLICATION NUMBER 


U.S. FILING DATE 
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FILING DATE 
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accattttgg cttttaaagg tacggttaac agcaagtgtg aaggtagcc 



<210> 82 
<211> 38 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 82 

gaacgcgagg cagatttcat taactgtgtg atctggcg 



<210> 83 
<211> 48 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 83 

tttaaaagag ggtagcatat gattaataat gtagtactag ttggtcgc 



<210> 84 
<211> 51 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 84 

tttaaattta aacctaggtt caatccattc tgactagaat ggaagatcgt 
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